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1 

e . AN ENZYME EXHIBITING CELLULASE ACTIVITY 

FIELD OF THE INVENTION 

5 The present invention relates to an enzyme exhibiting cellulase 
activity, a DNA construct encoding the enzyme , a cellulolytic 

* agent comprising the enzyme and a detergent composition 
containing the enzyme. 

10 BACKGROUND OF THE INVENTION 

Biomass which largely consists of cellulose, hemicellulose and 
lignin has attracted increasing attention as an important 
renewable source of energy (including nutritional energy) . The 

15 amount of carbon fixed by photosynthesis has been estimated to 
be 100xl0 9 tons per year worldwide, and half of that is contained 
in cellulose. If this material, or at least a significant part 
of it, could be converted into liquid fuel, gas and feed 
protein, this would constitute a significant contribution to 

20 solving the problem of recycling and conservation of resources. 
However, it has been found difficult to develop an economically 
viable process of converting cellulosic material into 
fermentable sugars. 

25 The currently most promising of the suggested processes involves 
the use of enzymes which are able to degrade cellulose. These 
enzymes which are collectively known as cellulases are produced 
by a number of microorganisms, including fungi (e.g. Trichoderma 
reseei , Humicola insolens , Fusarium nvygpnrmn f etc. ) and 

30 bacteria (e.g. Clostridium thermocellum , Cellulomonas spp., 
Thermonospora spp., Bacterioides spp., Microbispora bispora . 
etc.). The economics of the production of fermentable sugars 

• from biomass by means of such enzymes is not yet competitive 
with, for instance, the production of glucose from starch by 

35 means of a-amylase due to the ineffeciency of the cellulase 
enzymes. The most significant problems connected with the use of 
cellulases is their low specific activity and the high cost of 
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their production. Therefore, there is a need to develop 
cellulases which are more efficient in degrading cellulosic 
materials into fermentable sugars. 

5 Apart from their utility for the degradation of biomass, 
cellulases have also been suggested for use in detergent 
compositions for the treatment of cotton-containing fabrics 
which largely consist of cellulose. It is well known that 
repeated washing of cotton-containing fabrics generally causes 

10 a pronounced, unpleasant harshness in the fabric due to the 
presence of amorphous regions in the cellulose fibres, which 
regions form protruding parts on the otherwise smooth fibres. 
Several methods for overcoming this problem have previously been 
suggested. For example, US 1.368.599 of Unilever Ltd. teaches 

15 the use of cellulases for reducing the harshness of cotton- 
containing fabrics. Also, US 4.435.307 (of Novo Industri A/S) 
teaches the use of a cellulytic enzyme derived from Humicola 
insolens as well as a fraction thereof as a harshness reducing 
detergent additive. Other uses of cellulases mentioned in the 

20 art include* soil removal from and colour clarification of fabric 
(cf . for instance EP 220 016) . 

Although the use of cellulase enzymes for harshness reduction of 
cotton-containing fabrics was suggested and demonstrated nearly 

25 20 years ago the mechanism of this process has not been 
elucidated and is still not known in detail. Among other things, 
this is due to the multiplicity of the enzymes and the enzyme- 
catalyzed reactions involved. As a matter of fact, cellulases 
generated in nature e.g. by microbial species are indeed complex 

30 mixtures of cellulases. Accordingly, the conversion of natural- 
ly occurring materials, like cotton, catalyzed by cellulases is 
exceedingly difficult to analyze in detail. 

Due to these circumstances , the practical exploitation of 
35 cellulases for harshness reduction and prevention as well as 
colour clarification, however desirable, has not become 
widespread and of great practical utility: it is difficult to 
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optimise production of multiple enzyme systems and thus to 
implement industrial cost-effective production of cellulase en- 
zymes, and their actual use has been hampered by difficulties 
arising from the need to employ rather large quantities of the 
5 cellulases to achieve the desired reduction and prevention of 
the harshness of cotton fabrics: for instance, addition of large 
quantities of the enzymes to detergent compositions is not com- 
patible with the optimal function of other ingredients in the 
detergent formulation nor is the addition of very large 
10 quantities of enzymes to the detergent composition in the 
interests of, e • g . , consumer safety . 

The object of the present invention is therefore to provide 
cellulase enzymes with a high specific activity. 

15 

SUMMARY OF THE INVENTION 

The present invention relates to an enzyme which exhibits 
cellulase activity, which enzyme is producible by a strain of 

20 Bacillus spp. , NCIMB 40250, or by a related Bacillus spp. 
strain, or a derivative of said cellulase. The strain NCIMB 
40250 was deposited on 18 January, 199 0, in the National 
Collection of Industrial and Marine Bacteria, 23 St Machar 
Drive, Aberdeen, Scotland, UK, in accordance with the provisions 

25 of the Budapest Treaty on the International Recognition of the 
Deposit of Microorganisms for the Purpose of Patent Procedure. 

In the present context, the expression "enzyme exhibiting 
cellulase activity" is meant to be understood as an enzyme which 

30 is involved in the process of cellulose degradation. There are 
three different types of cellulases which act synergistically to 
produce soluble sugars: endoglucanases which show affinity for 
cellulose and which attack amorphous regions of low 
crystallinity in the cellulose fibre resulting in the formation 

35 of free ends; exoglucanases which initiate degradation from the 
non-reducing chain ends by removing cellobiose units; and B- 
glucosidases which hydrolyse cellobiose to glucose. 
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Tlie expression "related Bacillus spp. strain" is intended to 
indicate a strain belonging to the same Bacillus species as the 
strain NCIMB 40250 or a strain of a closely related species. The 
5 species to which the strain NCIMB 40250 belongs has been 
identified as Bacillus lautus . The scope of the present 
invention is also intended at least to include cellulase enzymes 
producible by other Bacillus lautus strains than NCIMB 40250. 

10 The term "derivative" is intended to indicate a protein which is 
derived from the native protein by addition of one or more amino 
acids to either or both the C- and N-terminal end of the the 
native protein, substitution of one or more amino acids at one 
or a number of different sites in the native amino acid 

15 sequence, deletion of one or more amino acids at either or both 
ends of the native protein or at one or more sites in the amino 
acid sequence, or insertion of one or more amino acids at one or 
more sites in the native amino acid sequence. 

20 Although the enzyme of the invention may be produced by 
cultivating the Bacillus spp. strain NCIMB 4 0250 or a related 
strain and isolating the enzyme from the culture, it will 
generally be more advantageous to produce the enzyme by 
recombinant DNA techniques which make it possible to optimize 

25 the yield of the enzyme produced. Furthermore, cloned genes 
encoding the enzymes may be modified in order to provide enzymes 
with improved properties. 

Thus, in another aspect, the present invention relates to a DNA 
30 construct which comprises a DNA sequence encoding an enzyme 
exhibiting cellulase activity, which enzyme is derivable from a 
strain of Bacillus spp., NCIMB 40250, or a related Bacillus spp. 
strain, or a derivative of said cellulase. The invention further 
relates to an expression vector which carries an inserted DNA 
35 construct as indicated above, as well as to a cell transformed 
with the DNA construct or with the vector. 
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In a still further aspect, the invention relates to a 
cellulolytic agent capable of degrading amorphous regions of 
cellulose fibres, the agent comprising an enzyme exhibiting 
cellulase activity as defined above. 

5 

The invention also relates to a detergent composition comprising 
the cellulolytic agent. The cellulase enzyme of the invention 
has surprisingly been found to be more stable during washing 
(for 60 minutes at 40 °C) in the presence of conventional 
10 detergents than a commercial cellulase preparation (Celluzyme™, 
a cellulase preparation isolated from Humicola insolens . 
available from Novo Nordisk, A/S) . The cause of the increased 
stability may reside in the alkalophilic nature of the enzyme 
(see example 5 below) . It is further speculated that it may also 
be ascribed to stability towards oxidation or towards the 
proteases commonly included in detergents. If so, the celluase 
enzyme of the invention may also show increased storage 
stability in liquid detergents containing proteases. 



15 



20 DETAILED DISCLOSURE OF THE INVENTION 

The cellulase enzyme of the present invention is preferably one 
which exhibits endoglucanase activity (referred to in the 
following as an endoglucanase) , in particular one which exhibits 
25 an endoglucanase activity of at least about 10, more preferably 
at least about 20, most preferably at least about 25, such as 
about 30, CMC-endoase units per mg of total protein under 
alkaline conditions. The endoglucanase activity is determined as 
the viscosity decrease of a solution of carboxymethyl cellulose 
(CMC) after incubation with the enzyme of the present invention 
under the following conditions: 



30 



A substrate solution is prepared, containing 35 g/1 CMC (Her- 
cules 7 LFD) in 0.1 M tris buffer at pH 9.0. The enzyme sample 
35 to be analyzed is dissolved in the same buffer. 

10 ml substrate solution and 0.5 ml enzyme solution are mixed 
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and transferred to a viscosimeter (e.g. Haake VT 181, NV sensor, 
181 rpm) , thennostated at 40°C 

Viscosity readings are taken as soon as possible after mixing 
5 and again 30 minutes later. The amount of enzyme that reduces 
the viscosity by one half under these conditions is defined as 
1 CMC-endoase unit. 

It should be noted that the endoglucanase of the invention is 
10 one which is active (in terms of CMC-endoase activity) under 
alkaline conditions. More specifically, the endoglucanase is one 
which has a pH optimum at a pH of about 7.5-10.5. Contrary to 
several known cellulases which are active at an acid pH and 
relatively inactive at alkaline pH values, this characteristic 
15 makes the endoglucanase of the invention particularly useful for 
washing purposes, in particular as an ingredient of a detergent 
composition, as washing of clothes is typically conducted under 
alkaline conditions due to the alkalinity of most washing 
detergents. Alkalophilic cellulases are known, e.g. from EP 271 
20 004, but they are not indicated to have a high affinity for 
cellulose, which is the case with the cellulase enzyme of the 
present invention which also exhibits a higher specific 
activity. 

25 The enzyme of the present invention is preferably one which is 
active at the temperatures at which clothes are typically 
washed, which is usually a temperature of up to about 60 ° C. 
Thus, the native enzyme isolated from strain NCIMB 40250 is 
active at temperatures between about 45 and 65°C. This, however, 

30 does not preclude the possibility that the enzyme may, under 
certain conditions, be active at temperatures above 65 °C. 

One enzyme according to the invention is an endoglucanase with 
an apparent molecular weight of 75 kD or a cleavage product 
35 thereof exhibiting endoglucanase activity. The term "cleavage 
product" is intended to indicate a shorter form of the enzyme 
resulting from, for instance, chemical or enzymatic cleavage 
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(e.g. by means of a suitable protease) after recovery of the 
enzyme or from posttranslational processing by the organism 
producing the enzyme, e.g. N- and/or C-terminal processing, 
which may give rise to a mature form of the enzyme. A specific 
5 example of a cleavage product of the -75 kD enzyme which is of 
interest for the present purpose is a product of approximately 
58 kD produced on cultivating a host organism transformed with 
DNA encoding the -75 kD enzyme. The -75 kD enzyme (and its -58 
kD cleavage product) are referred to in the following examples 
10 as Endol . 

The enzyme of the invention may be an endoglucanase encoded by 
the DNA sequence shown in the appended Sequence Listing ID#1 
(showing the sequence of the -75 kD enzyme) , or a modification 
15 thereof encoding a derivative of said endoglucanase. 

Endoglucanase derivatives may conveniently be provided by 
suitably modifying the DNA sequence coding for the native 
endoglucanase. Examples of suitable modifications of the DNA 

20 sequence are nucleotide substitutions which do not give rise to 
another amino acid sequence of the endoglucanase, but which may 
correspond to the codon usage of the host organism into which 
the DNA construct is introduced or nucleotide substitutions 
which do give rise to a different amino acid sequence and 

25 therefore, possibly, a different polypeptide structure without, 
however, impairing the properties of the endoglucanase. Other 
examples of possible modifications are insertion of one or more 
nucleotides into the sequence, addition of one or more 
nucleotides at either end of the sequence and deletion of one or 

3 0 more nucleotides at either end of or within the sequence. Such 
modifications of DNA coding for native proteins are well known 
and widely practised in the art. 

Another enzyme according td the invention is an endoglucanase 
35 with an apparent molecular weight of 56 kD or a cleavage product 
thereof (as defined above) exhibiting endoglucanase activity. In 
example 2 below, this enzyme is referred to as Endo2 . 
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The enzyme of the invention may be an endoglucanase encoded by 
the DNA sequence shown in the appended Sequence Listing ID#3 
(showing the sequence of the -56 kD enzyme) , or a modification 
5 thereof (as defined above) encoding a derivative (as defined 
above) of said endoglucanase. 

A further enzyme according to the invention is an endoglucanase 
with an apparent molecular weight of 45 kD or a cleavage product 
10 thereof (as defined above) exhibiting endoglucanase activity. A 
specific example of such a cleavage product is a protein of 
approximately 30 kD produced on cultivating a host organism 
transformed with DNA encoding the -45 kD enzyme. In example 3 
below, the -45 kD enzyme is referred to as Endo3A . 

15 

The enzyme of the invention may be an endoglucanase encoded by 
the DNA sequence shown in the appended Sequence Listing ID#5 
(showing the DNA sequence encoding the -45 kD product) , or a 
20 modification thereof (as defined above) encoding a derivative 
(as defined above) of said endoglucanase. 

Other enzymes exhibiting endoglucanase activity produced from 
endoglucanase clone 3 (cf. example 3 below) are proteins of 
25 approximately 60 and 56 kD, referred to as Endo3B and Endo3C , 
respectively . 

A still further enzyme according to the invention is an 
endoglucanase with an apparent molecular weight of 92 kD or a 

30 cleavage product thereof (as defined above) exhibiting 
endoglucanase activity. In example 4 below / this enzyme is 
referred to as Endo4. Other enzymes exhibiting endoglucanase 
activity produced from endoglucanase clone 4 (cf . example 4 
below) are proteins of approximately 74 and 71 kD. Either or 

35 both of these may be individual enzymes or cleavage products of 
the -92 kD enzyme. 
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It has been found that enzymes of the invention, e.g. Endol and 
Endo3A . are composed of a core region comprising the 
catalytically active domain and a region comprising a domain 
whose function is to mediate binding to cellulosic substrates 
5 (i.e. the cellulose— binding domain; this corresponds to a 
similar domain in an endocellulase from Bacillus subtil is 
(Nakamura et al., 1987). For example , the full-length ~75 kD 
form of Endol comprises a core region and a C-terminal 
cellulose-binding domain which, in some cases, may be cleaved 
10 off proteolytically leaving a core region of -58 kD. The 
presence of the cellulose-binding domain has been found to be 
important for obtaining a colour clarification effect in 
prewashed textiles (cf. example 6 below). 

15 Based on this finding, it may be possible to generate novel 
derivatives of cellulase enzymes which, for instance, combine 
a core region derived from an endoglucanase of the present 
invention with a cellulose-binding domain derived from another 
cellulase enzyme (e.g. one derived from a Cellulomonas f imi 

20 cellulase) . Alternatively, it may be possible to combine a core 
region derived from another cellulase enzyme with a cellulose- 
binding domain derived from an endoglucanase of the present 
invention. In a particular embodiment, the core region may be 
derived from a cellulase enzyme which does not, in nature, 

25 comprise a cellulose-binding domain, and which is C-terminally 
extended with a cellulose-binding domain derived from an 
endoglucanase of the present invention. In this way, it may be 
possible to construct cellulase enzymes with improved binding 
properties . 

30 

The DNA construct of the invention may be one which comprises a 
DNA sequence encoding any one of the enzymes described above, or 
derivatives of these enzymes as defined above. The DNA construct 
may be of genomic or cDNA origin, for instance obtained by 
35 preparing a genomic or cDNA library of an appropriate Bacillus 
spp. strain (e.g. strain NCIMB 40250 or a related strain) and 
screening for DNA sequences coding for all or part of the 
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appropriate cellulase by hybridization using synthetic 
oligonucleotide probes in accordance with standard techniques 
(cf. Maniatis et al., 1982). 

5 The expression vector of the invention carrying the inserted DNA 
construct encoding the enzyme of the invention may be any vector 
which is capable of replicating autonomously in a given host 
organism, typically a plasmid or bacteriophage. In the vector, 
the DNA sequence encoding the enzyme of the invention should be 

10 operably connected to a suitable promoter sequence. The promoter 
may be any DNA sequence which shows transcriptional activity in 
the host cell and may be derived from genes encoding proteins 
either homologous or heterologous to the host organism. The 
promoter is preferably derived from a gene encoding a protein 

15 homologous to the host organism. Examples of suitable promoters 
are lac of E.coli . daaA of Streptomvces coelicolor and amv L of 
Bacillus lichenif ormis . 

The expression vector of the invention further comprises a DNA 
20 sequence enabling the vector to replicate in the host cell. 
Examples of such sequences are the origins of replication of 
plasmids pUC19, pACYC177, pUBHO and pIJ702. 

The expression vector may further comprise a DNA sequence coding 
25 for a signal peptide in order to provide extracellular 
expression of the enzyme. The DNA sequence may, for instance, be 
the signal sequence from the a-amylase gene of lichenif ormis . 

The vector may also comprise a selectable marker, e.g. a gene 
30 whose product confers antibiotic resistance, such as ampicillin, 
chloramphenicol or tetracycline resistance, or the dal genes 
from B.subtilis or B . lichenif ormis . 

The procedures used to ligate the DNA sequences coding for the 
35 enzyme of the invention and the promoter , respectively, and to 
insert them into suitable vectors containing the information 
necessary for replication in the host cell, are well known to 
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persons skilled in the art (cf . , for instance, Maniatis et al., 
op* cit. ) . 

The host cell of the present invention may be transformed with 
the DNA construct of the invention encoding the cellulase enzyme 
described above. In this case f the DNA construct may 
conveniently be integrated in the host chromosome which may be 
an advantage as the DNA sequence coding for the cellulase is 
more likely to be stably maintained in the cell. Integration of 
the DNA constructs into the host chromosome may be performed 
according to conventional methods , e.g. by homologous 
recombination . 

Alternatively, the host cell may be transformed with an 
expression vector as described above. 

The host cell used in the process of the invention may be any 
suitable bacterium which, on cultivation, produces large amounts 
of the enzyme of the invention. Examples of suitable bacteria 
may be grampositive bacteria such as bacteria belonging to the 
genus Bacillus , e.g. Bacillus subtil is r Bacillus lichenif ormis . 
Bacillus lentus, Bacillus breyis, Bacillus stearothermoohilus - 
Bacillus alkalophilus . Bacillus amvlolicruef aciens , Bacillus 
coagulans, Bacillus circulans or Bacillus lautus . or 
gramnegative bacteria such as Escherichia col i . The 
transformation of the bacteria may for instance be effected by 
protoplast transformation or by using competent cells in a 
manner known per se . 

In a yet further aspect, the present invention relates to a 
method of producing a cellulase enzyme of the invention, which 
method comprises cultivating a host cell as described above 
under conditions conducive to the production of the cellulase or 
derivative thereof and recovering the cellulase or derivative 
thereof from the cells and/or culture medium. 

The medium used to cultivate the cells may be any conventional 
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medium suitable for growing bacteria. The cellulase may be 
recovered from the medium by conventional procedures including 
separating the cells from the medium by centrifugation or 
filtration , if necessary after disruption of the cells , 
5 precipitating the proteinaceous components of the supernatant or 
filtrate by means of a salt, e.g. ammonium sulphate, followed by 
purification by a variety of chromatographic procedures, e.g. 
ion exchange chromatography, affinity chromatography, or the 
like. 

10 

In a particular embodiment of the method of the invention, the 
cellulase is recovered in mature form, either as a result of 
posttranslational processing of a proenzyme as explained above 
or as a result of appropriate modifications of the DNA sequence 
15 encoding the enzyme in the form of deletions of DNA 
corresponding to truncations in the N- and/or C-terminal 
sequences of the enzyme. 

There is reason to believe that different cellulases may exert 

20 a synergistic effect with respect to the degradation of 
cellulose. The cellulolytic agent of the invention may therefore 
advantageously comprise a combination of two or more cellulase 
enzymes of the invention or a combination of one or more 
cellulase enzymes of the invention with one or more other 

25 enzymes exhibiting cellulase activity. Such cellulases may be 
endocellulases or exocellulases dependent on the intended use of 
the cellulolytic agent (e.g. the degree of cellulose degradation 
aimed at) . The other cellulases may be selected from those which 
may be isolated from species of Hum i col a such as Humicola 

30 insolens (e.g. strain DSM 1800), yusarium such as Fusarium 
oxvsporum (e.g. strain DSM 2672), Mvceliopthora such as 
Mvceli opthora thermoohile . Erwinia such as Erwinia 
chrvsanthermis (cf. M.H. Boyer et al., Eur. J. Biochem- 162, 
1987, pp. 311*316), Trichoderma such as Trichoderma reseei, 

35 Microb isoora such as Microbispora bispora f Neocal 1 imas t ix such 
as Neocal limastix frontalis . Piromonas such as Piromonas 
communis , Robillarda spp. , Cellulomonas such as Cellulomonas 
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fimi, Clostridium such as Clostridium thermocellum . Pseudomonas 
spp . , Thermonospora spp . , Bacterioideg spp . or Ruminococcus spp . 

The cellulolytic agent of the invention may suitably be in the 
5 form of a non-dusting granulate, stabilized liquid or protected 
enzyme. Non-dusting granulates may be produced e.g. according to 
US 4,106,991 and 4,661,452 (both to Novo Industri A/S) and may 
optionally be coated by methods known in the art. Liquid enzyme 
preparations may, for instance, be stabilized by adding a polyol 
10 such as propylene glycol, a sugar or sugar alcohol, lactic acid 
or boric acid according to established methods, other enzyme 
stabilizers are well known in the art. Protected enzymes may be 
prepared according to the method disclosed in EP 238,216. 

15 The cellulolytic agent may suitably exhibit an endoglucanase 
activity of 500-10,000 CMC-endoase units (as defined above) per 
gram of the agent. The cellulolytic agent is suitably a 
detergent additive which may comprise one or more other 
enzymes, such as a protease, lipase and/ or amylase, 

20 conventionally included in detergent additives. 

The detergent composition of the invention comprising the 
cellulolytic agent described above additionally comprises one or 
more surfactants which may be of the anionic, non- ionic, 
25 cationic, amphoteric, or zwitterionic type as well as mixtures 
of these surfactant classes. Typical examples of anionic 
surfactants are linear alkyl benzene sulfonates (IAS) , alpha 
olefin sulfonates (AOS), alcohol ethoxy sulfates (AES) and 
alkali metal salts of natural fatty acids. 

30 

The detergent composition of the invention may contain other 
detergent ingredients known in the art as e.g. builders, 
bleaching agents, bleach activators, anti-corrosion agents, 
sequestering agents, anti soil-redeposition agents, perfumes, 
35 enzyme stabilizers, etc. 

The detergent composition of the invention may be formulated in 
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any convenient form, e.g. as a powder or liquid. The enzyme may, 
if required, be stabilized in a liquid detergent by inclusion of 
enzyme stabilizers as indicated above. Usually, the pH of a 
solution of the detergent composition of the invention will be 
5 7-12 and in some instances 7.0-10.5. Other detergent enzymes 
such as proteases, lipases or amylases may be included in the 
detergent composition of the invention, either separately or in 
a combined additive as described above. 

10 The softening, soil removal and colour clarification effects 
obtainable by means of the cellulase enzyme of the invention 
generally require a concentration of the cellulase in the 
washing solution corresponding to an endoglucanase activity of 
5-200 CMC-endoase units per liter. The detergent composition 
15 of the invention is typically employed in concentrations of 0.5 
- 20 g/1 in the washing solution. Consequently, the cellulase 
concentration of the detergent composition of the invention is 
about 0.3 - 400 CMC-eridoase units per gram. In general, it is 
most convenient to add the detergent additive in amounts of 0.1 
20 - 5 % w/w * or, preferably, in amounts of 0.2 - 2 % of the 
detergent composition. For special applications, however, for 
instance when the detergent composition is to be used for colour 
clarification or harsness reduction of fabric which has been 
damaged by repeated washing, it may be convenient to include a 
much larger amount of the cellulolytic agent, such as about 20% 
w/w. 



25 



In a still further aspect, the present invention relates to a 
method of reducing the rate at which cellulose-containing 
fabrics become harsh or of reducing the harshness of cellulose- 
containing fabrics, the method comprising treating a cellulose- 
containing fabric with a cellulolytic agent as described above. 
The method of the invention may be carried out by treating 
cellulose-containing fabrics during washing. The cellulolytic 
agent may either be added as such in the amount required to 
obtained the desired effect, or it may be added as an ingredient 
of a detergent composition as described above. However, if 
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desired, treatment of the fabrics may also be carried out during 
soaking or rinsing or simply by adding the cellulolytic agent to 
water in which the fabrics are or will be immersed. 

5 The cellulolytic agent of the invention may also be employed to 
obtain colour clarification of cellulose-containing fabrics. 
After repeated washing, such fabrics often develop a grayish 
appearance. This effect is particularly evident with coloured 
fabrics, especially dark fabrics, and may probably be ascribed 

10 to undyed parts of the cellulose fibres becoming apparent when 
the cellulose fibres of which the fabric is composed are damaged 
by mechanical forces. The damaged parts of the fibres are 
assumed to be more amorphous than intact cellulose fibres and 
therefore more susceptible to the action of the cellulases of 

15 the present invention. The colour clarification effect is more 
pronounced when the cellulolytic agent contains an endoglucanase 
which comprises a cellulose-binding domain (cf. example 6 
below) • 

20 Accordingly*, the present invention further relates to a method 
of treating a coloured, cellulose-containing fabric in order to 
provide colour clarification, the method comprising treating the 
cellulose-containing fabric with a cellulolytic agent according 
to the invention. The method of the invention may be carried out 

25 by treating cellulose-containing fabrics in an aqueous medium 
during washing. The cellulolytic agent may either be added as 
such in the amount required to obtained the desired effect, or 
it may be added as an ingredient of a detergent composition as 
described above. However, if desired, treatment of the fabrics 

3 0 may also be carried out during soaking or rinsing or simply by 
adding the cellulolytic agent to water in which the fabrics are 
or will be immersed. For colour clarification purposes, the 
aqueous medium may suitably exhibit an endoglucanase activity of 
more than about 250 CMC-endoase units per liter of the aqueous 

35 medium. 

It may furthermore be possible to employ a cellulolytic agent 
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according to the invention to provide a localized variation in 
the colour of a fabric to impart a "stone-washed" appearance to 
the fabrics (for the use generally of cellulase enzymes for this 
purpose , see for instance EP 3 07 564) . 

5 

The cellulolytic agent of the invention is also contemplated to 
be useful in the field of paper pulp processing, e.g. pulp 
drainage (for the use generally of cellulase enzymes for this 
purpose, see for instance EP 262 040) , as well as for deinking 
10 of paper intended for recycling (for the use generally of 
cellulase enzymes for this purpose, see for instance JP 59-9299 
or JP 63-59494) . 

The invention is further illustrated by the following examples 
15 which are not in any way intended to limit the scope of the 
invention, with reference to the appended drawings, wherein 

Restriction maps of the Bacillus spp. DNA insert 
contained on the plasmids in the 4 endoglucanase- 
p'ositive E.coli clones. The position and direction of 
transcription of endoglucanase 1 (Endol, pPL517) and 
endoglucanase 2 (Endo2, pPL382) is indicated. Restrict 
tion enzyme sites are indicated as follows: PstI (P) , 
Hindlll (H) , Smal, (S) , Sail (Sa) , BamHI (Ba) , Balll 
(B), SphI (Sp) f EcoRT (E) . Wtm indicates pBR322 DNA. 

Restriction maps of different plasmids carrying the 
Endol gene (I j ) . The ^-lactamase promoter of pBR322 
is indicated by the arrow (Pb) and the direction of the 
transcription of the Endol gene is indicated by the 
arrow (Pb) . (mmmm) : pBR322, ( ) : "tail" of pBR322 
encoded amino acids. Restriction enzyme sites are 
indicated as follows: PstI (P) , Balll (B) , EcoRT . (E) , 
Clal (C) . The endglucanase activity in extracts of 
E.coli MC1000 containing the indicated plasmids is 
shown to the right as (A) cellulase units/ml culture 
medium. 
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Effect of temperature on the cellulase (Endol) in 
extracts of E.coli MC1000 (pPL517) and on the multiple 
cellulase activities in the supernatant of Bacillus 
see- PL236. The activity was measured at the tempera- 
ture indicated after an incubation period of 3 0 min. 
See Materials and Methods. The activity at the dif- 
ferent temperatures is presented as cellulase units/ml 
of the original culture volume. 

Relevant restriction sites and sequencing strategy of 
the endoglucanase Endol indicated as endocellulase l 

in the figure. (J > ) : extent and direction of 

sequence reactions. Abbreviations: EcoRl (E) , Pstl 
(P) , Bqlll (Bg), BstNI (Bs) , Aval (A), Clal (C) f BamHI 
(Ba). 

is a restriction map of plasmid pDN 28 01. Restriction 
enzyme sites are indicated as follows: EcoRl(E), Bglll 
TBg) , Hindlll (H) , Smal (Sm) , Sail (Sa) , SphI (Sp) , 
Pstl (P), Eagl (Ea), Clal (C) , BamHI (B) . CAT indicates 
the gene mediating chloramphenicol resistence. p 

m 

indicates the Bacillus maltogenic a-amylase promoter. 

Restriction maps of different plasmids fB. subtil is 
replication origin) carrying the Endol gene. The 
maltogenic alpha-amylase promoter is indicated by the 
arrow (P m ) , which also indicates the direction of the 
transcription of the Endol gene; (mmm) : pDN280l, 
(■) : tail of PDN2801 encoded sequences. Restriction 
enzyme sites are indicated as follows: Eagl (Ea) , Bglll 
(Bg) r Pstl (P) , BamHI (B) .The endocellulase activity 
in extracts of B.subtilis DN 1815 containing the 
indicated plasmids is shown to the right (A) as 
endoclucanase units/ml culture medium. 
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Fig. 7a Construction of pTLOS and pLA03 . (■): represents the 
and b C-terminal "tails" encoded by vector sequences. * 
indicates the expected C-terminal cleavage site. 
Restriction sites are abbreviated as follows: PstI (P) , 
5 Pvu (Pv) , BamHI (B) , Balll (Bg) , Sail (S) . 

Fig. 8a Construction of pCH57. ( \WS\ ) : Signal sequence of the 
and b alpha-amylase. ( ttl'M 1 "! ) : signal sequence of the Endol 
glucanase ( ) : "direct repeat" e.g. start of the 

10 mature Endol gene. ( • ) ribosome binding site of the 

alpha-amylase. (O) ribosome binding site of the Endol 
glucanase (M^) : alpha-amylase promoter. Restriction 
sites are abbreviated as follows: PstI (P) ; Sail (S) ; 
Konl (K) ; Eaal (Ea) r BallX (Bg) . 

15 

Fig. 9 Relevant restriction sites and sequencing strategy of 
the endoglucanase Endo2 (indicated as endocellulase 2 

in the figure). (I ^ ): extent and direction of 

sequence reactions. Abbreviations: Seal (Sc) , SacI 
20 fSa) , XmnI (X) f Hindi! (He) , Hindlll (H) , PstI (P) . 

Fig. 10 Zymogram showing the molecular weight of the active 
proteins resulting from different plasmids carrying the 
original endoglucanase 3 clone (pPL591) as well as 
25 deletions in the original insert - 

Fig. 11 Restriction maps of the endoglucanase 3 clone (in- 
dicated in the figure as endocellulase 3) (pPL59l) and 
plasmids containing deletions in the original insert. 
3 0 The position of the endoglucanase genes Endo3 A . Endo3B 

and Endo3C , on the insert in plasmid pPL591 predicted 
from the data shown in the zymogram (Fig. 10) is shown 
on the restriction map. Restriction enzyme sites are 
indicated as follows: Hindlll (H) , Smal (S) , EcoRI (E) . 

35 
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MATERIALS AND METHODS 

5 a) Bact erial strains and Dlasmids 

The donor strain, Bacillus spp. . strain PL236, was isolated from 
a compost sample from Lyngby, Denmark, on the basis of its high 
cellulolytic activity. A sample of this strain was deposited on 
18th January, 1990 in the National Collection of Industrial and 
10 Marine Bacteria, 23 St. Machar Drive, Aberdeen, Scotland, with 
the accession No. NCIMB 40250. 

The following E. coJU strains and plasmid were used: MC 1000 
( araD139) , (axa, leu.) 7697, lacX74, aalU . aalK . rpsL ^ (Casaba- 
dan et al. , 1980); CSR603 (F-, thr-1 . leuB6 . proA2 . prh-1 . 
RecAl, argE3 , thi-l, uvrAS. ara-14 . lacYl r aalK2 . xvl-5 , mtl- 
1, rpsL31 , tsx-33, -, supE44 ) (Sancar et al., 1979); PL248 is 
MC1000 containing the plasmid pNF2690 which contains the 
replication origin and the kanamycin resistance gene from 
PACYC177 (Chang and Cohen, 1978) and the cI857 repressor gene 
from the coliphage lambda; pBR322 (Bolivar, 1977) ; pUNl2l 
(Nilsson et al., 1983) pUC18 (Yanisch-Perron et al., 1985); 
PPLC28 (Remaut et al., 1981); P PL170 (Jargensen, P.L. , 1983); 

25 For the experiments with B. subtil is . the following B. subtil is 
strains and plasmids were used: DN1885 (amyE, amvR . spq* . pro* ) 
(Diderichsen, Novo Industri A/S) is a derivative of B. subtil is 
168; PL1801 is a derivative of DN1885 lacking the two main 
exoproteases (apr", npr') ; pDN2801 has the origin of replication 

30 from pUBHO (Keggins et al., 1987), the Cat gene of pC194 
(Horinouchi and Weisblum, 1982) and the maltogenic alpha- 
amylase promotor (Pm) from B.. stearothermophilus (Diderichsen 
and Christiansen, 1988) followed by a polylinker; The B. subtil- 
is/E.coii shuttle vector PJKK3-1 is described by Kreft et al. 

35 (1983) ; pPL1759 has the origin and kanamycin resistance gene of 
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pUBHO and the promoter, rlbosome binding site and signal 
sequence from the alpha-amylase gene from B. lichenif ormis 
(Stephens et al., 1984) • 

5 b) Media 

Phosphoric acid swollen cellulose (ASC) was prepared from 
chromatography cellulose (MN 3 00, Machery, Nagel) as described 
by Walseth (1952) with the exception that the cellulose powder 
10 was suspended in acetone before treatment with phosphoric acid. 
The medium for detection of cellulase activity was prepared as 
standard m) -medium (Maniatis et al., 1982) containing 0.2% ASC 
or microcrystalline cellulose (Avicel , Merck). 

15 Bacillus spp. . B.subtilis and E.coli cells were grown in NY 
medium (von Meyenburg et al., 1982) , LB medium (Maniatis et al., 
1982), or BPX medium (100 g/1 potato starch, 50 g/1 barley 
flour, 0.1 g/1 BAN 5000 SKB, 10 g/1 sodium caseinate, 20 g/1 
soybean meal; 9 g/1 Na 2 HPO A , 12H z O, 0.1 g/1 Pluronic) . 

20 

The media were solidified by the addition of agar (20 g/1) . 

Tetracycline (20/xg/ml) , or kanamycin (10 /ig/ml) were added as 
required. 

25 

c) Isolation of DNA 

To isolate the chromosomal DNA from Bacillus spp . PL236, cells 
from 250 ml overnight culture were resuspended in 10 ml (50 mM 
Tris-HCl, pH 8.0, 100 mM EDTA) , and incubated with 25 mg 

30 lysozyme for 20 min. at 37 *C. To the mixture was added 2 ml of 
10% (w/v) SDS, mixed and put on ice for 10 min. To the solution 
was then added 15 ml of phenol saturated with TE-buffer (10 mM 
Tris-HCl, pH 8.0, 1 mM EDTA), heated to 65 # C, mixed gently and 
cooled on ice. After centrifugation for 30 min. at 40000 g the 

35 aqueous phase was ether extracted, ethanol precipitated and the 



WO 91/10732 



PCT/DK91/00013 



21 

pellet was resuspended in TE-buffer. The DNA was further 
purified by banding in a CsCl density gradient (Maniatis et al., 

1982). 

5 E. coli plasmid DNA was prepared by the SDS lysis method 
(Maniatis et al., 1982); minipreparations of plasmid DNA for 
restriction enzyme analysis and transformations were prepared 
according to Holmes and Quigley (1981) . B. subtil is plasmid DNA 
was prepared by the alkaline lysis method. (Maniatis et al., 
10 1982). 

d) Cloning of chromosomal DNA from Bacillus spp. PL236 into E. 

coil 

Restriction enzymes and T4 DNA ligase were obtained from New 
15 England Biolabs and used as described by the manufacturers. 
After digestion with Fstl . EcoRI or Hindlll restriction enzyme 
the DNA was heated to 65 *C for 10 min. and ethanol precipita- 
ted. 10 fi of linearized pBR322, pJKK3-l or pUN121 and 20 g of 
fragmented fi\ s pp . chromosomal DNA were ligated with 5 units of 
20 DNA ligase in a final volume of 100 jil (16 h, 15*C) . The ligated 
DNA was used to transform competent E. coli MC1000 to 
tetracycline resistance essentially as described by Mandel and 
Higa (1970). (pBR322: 20 pg/Vil, pUN121: 7 fig/ml and pJKK3-l: 10 
jig/ml). 

25 

e) Detection of cellulase-positive E.coli clones 

The detection of cellulase activity on plates was performed 
using a modification of the technique of Teather and Wood 
(1982) . E. coli clones were grown overnight on solid NY medium at 

30 37 *C. Cells were lysed by overlaying the colonies by topagar 
containing phosphate buffer (100 mM, pH 7.0), agar (0.7%) CMC 
(0.2%), SDS (0.25 mg/ml) , and chloramphenicol (200 /i/al) and 
subsequent incubation overnight at 37 "C. Plates were then 
flooded with an aqueous solution of Congo red (1 mg/ml) for 15 

35 min. and subsequently washed with 1 M NaCl. Cellulase-positive 
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colonies were surrounded by a yellow halo on a red background. 
f) Maxicells 

Plasmid-encoded proteins were analysed using the maxicell method 
of Sancar et al. (1979) with the following modification. After 
5 UV irradiation, the surviving cells were killed by incubating 
the cells with D-cycloserine (150 ng/ml) for 48 h at 37 *c. 

cj) Gel electrophoresis 

( S)-L. methionine labelled maxicell proteins and other proteins 
10 were analysed by electrophoresis on 15% (0.075 per cent 
bisacrylamide) SDS-polyacrylamide gels (Laemmli, 1970). Proteins 
were visualized either by staining with Coomassie Blue G 250 or 
by autoradiography. 

15 Analysis of DNA was done by electrophoresis on agarose gels with 
the buffer described by Loening (1967) . 

h) Detection of ceilulase acti vity in pglvacrvlamide gels 
Detection of. ceilulase activity in protein bands separated by 
SDS-polyacrylamide gel electrophoresis was done by a modified 
zymogram technique described by Beguin (1983) . Protein prepara- 
tions were electrophoresed on a SDS-polyacrylamide gel as 
described above and the gel was washed 3 times 30 min. in 
phosphate buffer (100 mM, pH 7.0) layered on to a thin (0.8 mm) 
25 agarose gel (agarose, (1.8 per cent), CMC, (0.2 per cent), 
phosphate buffer, (100 mM, pH 7.0)), and incubated for 4 h at 
42 *C. Ceilulase activity was visualized by staining the agarose 
gel for 30 min. in an aqueous solution of Congo red (1 mg/ml) 
followed by washing the gel in 1 M NaCl. 



20 
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i) Colo r imetric ceilulase assay 

Ceilulase activity in cell extracts was analysed by measuring 
the increase in reducing groups released by the hydrolysis of 
CMC (Miller, 1959) . An appropriate amount of enzyme was in- 
35 cubated with 1.5 ml of 1 per cent CMC in phosphate buffer (100 
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mM, pH 7.0). After 30 min. of incubation at 55*0, 1.5 ml of 
dinitrosalicyclic acid reagent was added and the samples were 
boiled for 5 min. The absorbance was read at 550 nm against 
blanks containing equivalent amounts of extract from the E.coli 
5 recipient strain. One unit of cellulase released 1 nM of glucose 
equivalents per second by reference to a standard curve. 

i ) DNA-Sequencina 

Single end labelled DNA fragments were isolated and sequenced by 
10 the chemical modification method (Max am and Gilbert, 1980) . The 
cleavage products were separated on 8% or 20% poly aery 1 -amide 
gels and thereafter autoradiographed at -70 *C using intensi- 
fying screens. 

15 The dideoxyribonucleotide method of Sanger et al., (1977) was 
used for the sequencing of Endo3A using derivatives of pUC18 
(Yanisch-Perron, 1985) . 

k) Southern -analysis 

20 Chromosomal DNA from Bacillus spp. (PL236) was digested with 
restriction enzymes as required and fractionated on 1% (w/v) 
agarose gels. DNA was then blotted onto nitrocellulose filters. 
32 p -labelled DNA probes (recombinant plasmids) were prepared by 
nick translation (Rigby et al. , 1977) using 32^^^ (Amersham) 

25 and hybridization was carried out as described by Southern 
(1975) . Autoradiography was performed at -70 *C using intensi- 
fying screens. 

1^ Transformation of competent B. snh^ Uis cells 
30 A modified version of Dubnau and Davidof f-Adelson 1 s (1971) 

procedure for preparing competent cells of B. subtilis is used. 

10 ml of LB-medium is inoculated with the strain in the morning. 

7 hours later sequential dilutions in KM-l-medium are made and 

incubated overnight at 37 *C. The following morning, the second- 
35 most diluted and growing culture is diluted ten times in KM-2- 
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medium. The cells are harvested after 45-60 minutes of 
incubation by centrifuging for 3 minutes at 7K. They are 
resuspended in 1/10 volume of the supernatant and 1/50 volume of 
86% glycerol is added, o.l ml amounts are frozen on liquid 
nitrogen and stored at -80 °C. 
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In order to transform these competent cells, the method of 
Ehrlich (1986) is used, with some modifications. BTF is 
prepared and preheated to 42'C. 0.01 ml of DNA is placed in a 
reaction vessel, and the competent cells are thawed at 42'C. BTF 
is added to the cells at a ratio of 1:1, and 0.1 ml of the 
mixture is added to the DNA. The cells and DNA incubate with 
shaking for 20 minutes at 37 -C. A further 30 minutes of gene 
expression with 0.1 ml of NY-medium is needed, if kanamycin 
resistance is desired. The cells are finally spread on relevant 
plates. Recipes for stock-solutions are as follows: 

Stock solutions for making B. subtilis competent cells 



Salt mix: 



10 X MM: 



KM-stock: 



KMl-stock: 



KM2 -stock: 



35 



10 mM Caci 2 , ImM FeCl 3 and lmM MnCl P . 

20 g (NH 4 ) 2 S0 4 , 60 g KH 2 P0 4 , 140 g 
K2HP0 4 '3H 2 O in 1L. 

100 ml 10 X MM, 10 ml 10% Na- 
citrate, 2 ml 1M MgS0 4 in 1L. 

960 ml KM-stock, 2 0 ml 20% glucose, 
1 ml 20% casamino acids, 20 ml 5% 
yeast extract, 30 1 lmM MnCl 2 in 1L. 

960 ml KM-stock, 20 ml 20% glucose, 
1 ml 20% casamino acids, 20 ml 5% 
yeast extract, 1 ml salt mix, 1 ml 
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0.5 M CaCl 2 , and 0.8 ml 1M MgCl 2 in 
1L. 



BCG; 



100 ml 10 x MM, 10 ml Na-citrate, 2 
ml 1M Mgso A , 1 ml saltmix, 20 ml 20% 
glucose in 1L. 



5 



BTF : 



800 ml BCG, 100 ml lOmM EGTA, 100 mM 
MgCl 2 in 1L. 

(Prepared fresh before use) . 



10 



ISOLATION AND CHARACTERIZATION OF CELLUIiQ LYTIC STRAINS 

In order to clone genes coding for cellulose-degrading enzymes 
15 "the following screening programme was set up to find suitable 
cellulolytic donor strains. Various compost samples were used as 
source of cellulolytic microorganisms. 

Serial dilutions of compost samples were plated out on AS C agar 
20 medium and cellulolytic activity was detected through the 
formation of clearing zones around the colonies. Several 
cellulolytic bacteria were isolated. One of the most active of 
these isolates which was identified as Bacillus spp. PL23 6 
(NCIMB 40250) was selected as the donor strain for the cloning 
25 experiments. 

The strain rapidly degraded both acid swollen cellulose and 
microcrystalline cellulose (Avicel, Merck) on agar medium. At 
the optimum temperature of growth (42 # C) the clearing zones 
30 appeared in 2-3 days. 

Adding small amounts of Bacillus spp . PL236 culture to suspen- 
sions of microcrystalline cellulose makes the cellulose crystals 
lump together tightly and sediment. 



35 
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This suggests that either the Bacillus spp. PL236 cells or the 
extracellular cellulase enzymes have a strong affinity for the 
cellulose substrate and tightly bind the cellulose crystals 
together . 

5 

SCREENING ASSAY FOR RECOMBINANT CLONES 

The first attempts to clone cellulases from Bacillus spd . PL236 
were directed towards the endocellulases genes from Bacillus 
10 see. PI.236. To facilitate the screening on plates of endocellu- 
lase positive clones, an assay using the dye Congo red (Teather 
and Wood, 1982) was adapted to E.coli. 

In order to detect cellulase activity trapped inside the 
15 recombinant E.coli cells the cells were lysed by adding SDS to 
the top agar. This modification of the top agar had no measure- 
able effect on the CMC-degrading enzymes of Bacillus spE-PL23 6 f 
when the modified assay was used on this organism. 

20 MOT.^PTTT AR CLONING OF ENDOCELLULASES 

Several endocellulase genes were cloned from the cellulolytic 
Bacillus spp. strain (PL236) . An endocellulase 1 clone (PL517) 
was made by ligating PstI partially digested PL23 6 chromosomal 
25 DNA with PstI cleaved pBR322 and subsequently transforming 
competent E. coli cells. 

An endocellulase 2 clone (pPL382) was made by ligating Hindlll 
partially digested PL236 chromosomal DNA with Hindlll cleaved 
30 pJKK3-l (an E. col i/B. subtil is shuttle-vector) and subsequently 
transforming competent E. coli cells. 



35 



An endocellulase 3 clone (pPL591) was made by ligating EcoRl 
partially digested PL236 DNA with EcoRl cleaved pUN121 and 
subsequently transforming competent E. coli cells. 
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An endocellulase 4 clone (pPL592) was made by ligating Hindlll 
partially digested PL236 DNA with Hindlll cleaved pUN121. and 
subsequently transforming competent E.coli cells. 

5 

Trans formants derived from the use of both the pBR322 and the 
PUN121 plasmid vector were screened by their tetracycline 
resistance (pBR322: 20 g/ml, pUN121: 7 tig/ml and pJKK3-l: 10 
Mg/ml). 

10 

The transfonnants were replicated to another set of plates and 
overlayed by the modified CMC top agar. The plates were in- 
cubated overnight and stained with Congo red as described above. 

15 Under the reisolation procedure it was observed that it was 
possible to detect positive clones without adding the cell 
lyzing agent SDS to the topagar, although the diameter of the 
halo was considerably smaller. 

20 EXPRESSION AND CHARACTERIZATION OF THE CLONED CELLUIASES 

Plasmids from the cellulase-positive transfonnants were isolated 
and analyzed with restriction enzymes. Restriction enzyme maps 
of representative plasmids from all four cloning experiments are 
25 shown in Fig. 1. 

To determine the molecular weight of the cloned endoglucanases 
a zymogram technique (Beguin, 1983) was used. Total protein 
preparations from representative endoglucanase clones were 

30 separated on a SDS-polyacrylamide gel. The proteins in the gel 
were then renatured by washing out the SDS and replicated onto 
an agarose gel containing CMC. Renatured proteins diffuse to 
this activity gel and proteins representing endoglucanase 
activity hydrolyse the CMC in the gel. The endoglucanase bands 

35 were then visualised by staining the activity gel with Congo red 
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as described above. 

As appears from Fig. 1, the restriction maps of the cloned DNA 
(as well as the molecular weight of the endoglucanases encoded 
5 by the cloned DNA) are different in the four different clones. 
This indicates that these four clones represent at least four 
different endocellulase genes from Bacillus spp . (PL23 6) . 

The detailed analysis of the different endocellulases represen- 
10 ted by these four clones is described in the following. 

Example 1 

Endocellulase 1 (Endol) 

Physical mapping of endocellulase gene 1 

15 

In the cloning experiment described above several cellulase- 
positive E.coli clones were obtained, which contained different 
fragments of PstI partially cleaved Bacillus spp . PL23 6 DNA. 
Fig. 2 shows the restriction maps of some of these clones. The 

20 plasmids invariably contained two PstI fragments (1000 and 1350 
bp.) indicating that both were necessary for synthesis of a 
polypeptide with cellulase activity: Nucleotide sequencing has 
later shown that the 500 bp. PstI fragment present the on pPL217 
and pPL517 (Fig. 2) contains the C-terminal part of the complete 

25 cellulase gene (data are presented in the following) . The 
structure of this part of the B . spp . chromosome was confirmed by 
Southern analysis (data are presented in the following) . 

Activity measurements 

30 

Extracts from the E. coli clones containing pPL212, pPL2l6, 
PPL517 and pBR322 were prepared from overnight cultures grown in 
NY medium supplemented with tetracycline. Cells were 
concentrated 10-fold in 100 mM phosphate buffer, pH 7.0. DNase 
35 was added and the cells were ruptured by passing them twice 
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through a French Press (12000 lb/ in) . The extracts were 
centrifuged for 60 min. at 40000 X g and the supematants were 
assayed for cellulase activity. The cell-free extract of E.coli 
MC1000(pPL212) , E.coli MC1O00 (pPL517) and E. coli MC1000 (pPL216) 
contained 8 units, 7.2 units and 0.2 units respectively of 
cellulase/ml of original culture volume. (Fig. 2) . 1/7 of the 
total cellulase activity in these cultures was found in the 
supernatant. The E.coli MC1000(pBR322) clone showed no cellulase 
activity. 

The high level of cellulase activity in extracts of strains 
carrying the plasmids pPL212 and P PL517 is most likely due to an 
increased transcription of the cellulase gene originating from 
the ^-lactamase promoter on pBR322. It was concluded that the 
cellulase is expressed in the direction shown in Fig. 2. 

Measurements of the viscosity of a CMC solution and of the 
release of reducing sugars indicated that the cloned cellulase 
is an endo-(l,4)-/9-glucanase. 

Maxicell an d zymogram analysis 

The molecular weight of the endoglucanase protein was analysed 
by the maxicell technique. The plasmids pPL212, pPL216 and 
PBR322 were transformed into the maxicell strain CSR603 (Sancar 
et al., 1979) to analyse for plasmid-encoded proteins. The 
plasmid pPL212 gave rise to three polypeptides of 75000 D, 65000 
D and 58000 D in addition to the proteins encoded by pBR322. 
Apart from the pBR322 proteins, no proteins encoded by the 
plasmid pPL216 could be detected, due to the low expression of 
the cellulase gene. The three polypeptides from pPL212 were 
tested for cellulase activity using a gel replica technique, 
(Beguin, 1983). 

A comparison of the cellulase activity bands and the bands on 
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the autoradiogram from the polyacrylamide gel showed that the 
75000 D and at least one of the 58000 and 65000 D protein bands 
had cellulase activity. It was also found the supernatant of the 
Bacillus see- PL23 6 culture contained at least three different 
5 proteins with cellulase activity. One of these proteins comigra- 
ted with the 58000 D protein synthesized in the maxicell. Cell 
extracts from E.coli MC1000 (pPI.212) only revealed one active 
band comigrating with the 58000 protein from the maxicell E. coli 
CSR603 (pPL212) . As mentioned previously, the following sequence 

10 data showed that the plasmids pPL212 and pPL216 did not contain 
the entire Endol endoglucanase gene. The 75000 D active protein 
seen in the maxicell E. coli CSR603 (pPL212) is thus a fusion 
protein, where 105 C-terminal amino acids are encoded by pBR3 22 
sequences (Fig. 2) . This fusion protein is apparently posttrans- 

15 lationally processed, ending up with the 58000 D mature endoglu- 
canase. Cell extracts from E. coli MC1000 (pPL217) and E.coli 
MC1000 (pPL517) , which contain the complete endoglucanase gene, 
also gave activity bands of M r approx 58000 D and 75000 D. The 
75000 D active, protein from the plasmids pPL217 and pPL517, thus 

20 represents the "genuine" initial translation product from the 
Endol glucanase gene, which apparently is processed more slowly 
than the fusion protein from pPL212. However, both the 75000 D 
fusion protein synthesized from pPL212 and the "genuine" 75000 
D protein synthesized from pPL517 and pPL217 are processed down 

25 to a 58000 D protein with high cellulase activity. 

From the sequence data it can also be predicted that the 
endoglucanase expressed from pPL216 is synthesized a fusion 
protein, where the 26 C-terminal amino acids are encoded by 

3 0 pBR322 sequences (Fig. 2) . This protein was not detected by the 
maxicell technique and zymograms using extract from PL216 only 
revealed one active band of 58 000 D, which most likely repre- 
sents the processed protein. The processing in E. coli of the two 
fusion proteins from pPL212 and pPL216, which represent two 

3 5 different lengths of the C-terminal "tail" thus results, in both 
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case, in an active protein of approx. 58000 D . This indicates 
that the endocellulase is processed from the C-terminal, because 
N-terminal processing would result in two proteins with a 
difference in M r of approx. 9000 D, which would easily have been 
detected on the zymograms. 

It is therefore most likely that endoglucanase l (Endol) is 
synthesized as a proenzyme which at least in E. coli (and 
possibly also in B. spp . ) is modified by stepwise removal of 
approx. 150 C-terminal amino acid residues and approx. 3 0 N- 
terminal amino acid residues, corresponding to the removal of 
the signal peptide. The endoglucanase seems to be modified 
correctly as indicated by the fact that the final processing, 
product, the 58000 D activity band present in E. coli MCiooo 
15 (pPL2l2) extract, apparently comigrate with one of the endoglu- 
canases present in the supernatant of cultures of Bacillus spp . 
PL236. 



10 



20 



Temper ature optimum and stability 



The cellulase activity of the extracts was measured at dif- 
ferent temperatures and the highest activity of Endol produced 
in E.coli was found at 60'C (Fig. 3). The heat stability of the 
endoglucanase was tested by incubating the extracts at sec, 

25 55 *c and 60 'C for varying periods and the residual activity was 
measured as outlined in Materials and Methods. Although the 
highest activity was observed at 60 *C with a fixed incubation 
time of 30 min. , the enzyme is inactivated at this temperature 
with atjof 1.2 h. At 50'C and 55'C no inactivation was observed 

30 after 5 h of incubation. 

DNA-seauenee 

The nucleotide sequence of the endocellulase gene 1 (Endol) was 
35 deduced from the plasmid pPL5l7 which contains approx. 2850 bp 



REFlACEMENTSHEET 



WO 91/10732 



PCT/DK91/00013 



32 

of Bacillus spp . PL236 DNA. 

The sequence wasa determined by the chemical modification method 
(Maxam and Gilbert, 1980) using the partial restriction map and 
the sequencing strategy outlined in Fig. 4. 

The complete nucleotide sequence is shown in Sequence listing 
ID#l. A computer analysis of this seguence revealed only one 
open reading frame long enough to encode the approx. 7500 0 D 
protein detected by the maxicell and zymogram analysis of 
extracts from the cellulase-positive E. coli MC1000 (pPL5l7) . This 
seguence which begins at nucleotide 677 and ends at nucleotide 
2776, encodes an enzyme of 700 amino acids. The M r calculated 
from the DNA sequence was 77006 D. 

Within the open reading frame there were three potential initia- 
tion codons (ATG at positions 677, 737 and 749), but only the 
ATG codon at position 677 was preceeded by a ribosome binding 
site (AAGGAGGX (Mclaughlin et al., 1981). Jt was therefore 
concluded that the ATG codon at position 677 was the correct 
initiation codon. 



The initiation codon is followed by an amino acid sequence which 
resembles signal sequences found in gram-positive organisms. 
Such sequences consist of a relatively short hydrophilic region 
at the N-terminal followed by a longer sequence of hydrophobic 
residues. 



By using the signal sequence cleavage model proposed by Heijne 
(1983) the cleavage site can be predicted to be between the two 
first alanine residues in the sequence Asn-Ala-Ala-Ala. The 
signal seguence is thus 31 amino acids long. 

The upstream and downstream regions contained no significant 
homology to the consensus sequence of the sigma 43 promoter of 
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B. subtil is and no terminator ^1 ike sequences. 
Southern analysis 

The Bacillus see. (PL236) chomosomal DNA was digested with 
HindHI , Pstl, EcoRl and Xhol and plasmids pPL212 and pPL509 
were used as probes for the hybridization. Plasmid pPL212 
contains two Pstl fragments (1350 bp. and 1000 bp.) and plasmid 
PPL509 contains only the 500 bp. Pstl fragment of the entire 
Endol gene, represented by the plasmid pPL517, which contains 
three Pstl fragments (1350 bp., 1000 bp. and 500 bp.) of 
Bacillus see. PL236 DNA. The pPL212 probe recognized the 
expected two Ps£l fragments (1350 bp., 1000 bp.) and the pPL509 . 
the 500 bp. Pstl fragment in the Bacillus spp . PL236 Pstl 
digest. Both the pPL212 and the pPL509 probe also recognized the 
same overlapping £coRl fragment and the same overlapping Hindlll 
fragment in Bacillus see. PL23 6, EcoRT and Hindlll digest. These 
results indicate that the Bacillus spp. PL236 DNA insert in 
PPL517 was cloned in a non-deleted form and that the three Pstl 
fragments in pPL517 are continuous on the Bacillus spp . PL23 6 
chromosome. 

Expression of the Endol gene in R-gntvtilis 

For the cloning experiments in B. subtil is . pPL517 was used as 
the donor of the EndPl gene and pDN2801, carrying a strong 
Bacillus promoter P m , was used as the Bacillus vector (Fig. 5) . 

The Endol gene-containing Eaal fragment was ligated to EaoT 
cleaved pDN2801 and by subsequent transformation to compentent 
B.subtilis cells (DN1B85) , strain CH7 was obtained. To test 
whether the processed C-terminal part was necessary for the 
expression of the Endol gene in B. subtilis cells, a construc- 
tion was made where the Endol gene was fused to vector sequen- 
ces in the internal BglXI site. This fusion replaces the coding 
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region for the C-terminal 94 amino acids with 55 "random" amino 
acids encoded by vector sequences. 

Similar constructions made in E. coli vectors, though fused to 
different vector sequences, resulted in an active periplasmic 
endoglucanase in E. coli . which was processed in the "correct" 
manner. Part of the Endol gene contained in the Balll fragment 
from pPL517 was ligated with BamHI cleaved pDN2801 and sub- 
sequent transformation to competent B. subtil is cells (DN1885) 
resulted in strain CH14 . 

Transformants were in both cases screened for their chloramphe- 
nicol resistance, and the desired plasmid constructions in the 
strains CH7 and CH14 by restriction analysis of their plasmids. 
The two versions of the Endol gene on the plasmids pCH7 and 
PCH14 are thus transcribed from the same promoter P m . a 
restriction map of the plasmids is shown in Fig. 6. 

The B.subtiljs, DN1885 used for these experiments produces an 
endoglucanase of its own, which of course gave some background 
activity . The Endol gene product was exported to the culture 
medium from the recombinant strain CH7, and the activity 
measured in the culture supernatant was approximately 20 times 
higher than the background activity (Fig. 6) . No extracellular 
activity above the background level was detected from the 
recombinant strain CH14 which contains the Endol gene with the 
substituted C-terminal. 



The culture supernatants from the strains CH7 and CH14 and cell 
extract from CH7 cells were analysed by the zymogram technique. 
The zymogram revealed active protein bands of approx. 75000 D 
and 58000 D from the CH7 cell extract and only one active 
protein band of approx. 58000. D from the CH7 culture superna- 
tant. These bands correspond to those observed in E. coli and the 
processing of approx. 90 amino acids from the C-terminal appears 
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to take place in B . subtilis too. 

The plasmid pCH7 was transformed to PL1801, which is a der- 
ivative of DN1885 lacking the two main exoproteases (apr",npr~ 
5 ) , resulting in the strain CH14. The Endol cellulase as produced 
from CH14 was processed "normally" indicating that the two main 
exoproteases from B. subtilis are not responsible for the C- 
terminal processing of the Endol cellulase. 

10 A very weak active band of approx. 58000 D was detected from the 
CH14 culture supernatant, indicating that the manipulated gene 
is expressed and processed in at least almost the same way as 
the native gene product. Among other things, the very low 
expression from pCH14 and the fact that the two genes are 

15 expressed from the same expression signals may indicate that the 
approx. 90 C- terminal amino acids are necessary for the export 
of the Endol gene product from B. subtilis . 

Optimization of expre ssion of the Endol gene in E . coli 

20 

In order to optimize the expression of Endol . the Endol gene was 
combined with the strong E.coli promoters P R and P L originating 
from phage lambda (Remaut et al., 1981). Both promoters are 
repressible by the lambda CI857 repressor, which is heat labile, 
25 thus rendering the P R and P L promoters heat inducible, in cells 
producing the lambda CI857 gene product. (Ptashne et al. , 1982) . 

The P R promoter is contained on the expression plasmid pPLl7 0 
together with the lambda cI857 gene. (Fig. 7; Jorgensen, 1983). 

. 30 

The P R promoter was placed upstream of the Endol gene by ligating 
the P R containing Pvul - Sail fragment from pPL170, to the Endol 
gene containing Pvul - Sail fragment. Transformation to 
competent MC1000 cells resulted in the strain TL05 containing 
35 the plasmid pTL05. In the plasmid pTL05 the ^-lactamase promoter 
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is deleted, thus bringing the Endol gene under transcriptional 
control of the P R promoter. At this point, the Endol gene was 
believed to be contained within the Ball! fragment from pPL2129. 
The Endol gene fusion to vector sequences on pPL212 was 
therefore transferred to pTL05, resulting in a fusion protein 
where 105 C-terminal amino acids are encoded by vector 
sequences. This fusion protein is however processed correctly as 
shown earlier with the strain PL212. The cellulase production 
from TL05 is completely repressed at 28 *C and induced at 42 °C. 

The P L promoter provided on the plasmid pPLc28 (Remaut et al., 
1981) was combined with the Endol gene by ligating the Ball I 
fragment from pPL212 to BamHI cleaved pPLc28. Transforming to 
competent PL248 cells, which are harbouring the lambda cI857 
gene on a compatible pACYC177 based plasmid (pNF2690) , resulted 
in the strain IA03 , containing the plasmid pLA03 (Fig. 7). The 
Endo1 gene is thus fused to vector sequences, but due to 
unspecified DNA sequences in pPLc28 the length and nature of the 
resulting fusion protein is unknown. The cellulase production 
from IA03 was completely repressed at 28 'C and induced at 
40.5°C. 

The cellulase production from LA03 and TL05 was evaluated at 
different temperatures . 

IA03, TL05 and PL212 were grown overnight at 28 *C in NY medium 
supplemented with the appropriate antibiotics (AMP + KAN, KAN 
and TET, respectively) . For each strain the overnight cultures 
were diluted 100 fold in NY medium (AMP + KAN, KAN and TET, 
respectively) , and the diluted cultures were grown at different 
temperatures between 28 °C and 42 °C. Cells from each culture were 
harvested at OD 450 - 1 and lysed on a French Press, and the 
activity in the extracts was determined as described earlier. 

LA03 which exhibited the highest cellulase production was unable 
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to grow at temperatures above 4 0.5*C. The experiment was 
repeated without antibiotic selection pressure in the diluted 
cultures. Similar results were obtained, but 1A03 grew very 

. slowly at temperatures above 40.5°C. However, this growth was 

5 f ollowd by a significant loss of the plasmid pIA03 . No signif i- 

_ cant loss of pI_A03 at temperatures up to 4 0.5*C or pPL212 and 

pTL05 at any temperature, was observed. 

Optimization of expre ssion of the Endol gene in B. subtil is 

10 

In order to optimize the expression of the Endol gene in 
B. subtilis, the Endol gene was fused to the expression-signals 
(promoter, ribosome binding site and signal sequence) from the 
alpha-amylase gene from B. lichenif ormis , which is expressed in 
15 high amounts in B. subtilis . 

PPL1759 contains the promoter, ribosome binding site and most of 
the signal peptide of the B. lichenif ormis alpha-amylase 
(Stephens et a_l., 1984) . The downstream side of this region ends 

20 with a PstI site, which again is followed by a poly linker (Fig. 
8). Between the PstI and the Sail site in the polylinker of 
PPL1759, a synthetic DNA fragment consisting of two comple- 
menting oligonucleotides creating PstI and Sail "sticky" ends 
was inserted. In the resulting plasmid pCH52 the synthetic 

25 linker reconstitutes the missing part of the signal peptide of 
the alpha-amylase and further encodes the first 14 N-terminal 
amino acids of the mature Endol cellulase (Fig. 8) . The linker 
thus creates a hybrid signal peptide cleavage site between the 
alpha-amylase and the cellulase. The expected cleavage site is 

3 0 shown in Fig. 8. From pPL517 the Endol gene was excised without 
promoter on an Eagl fragment and inserted into the unique Eaal 
^ site in pCH52. The plasmid in which the Endol gene was inserted 

in the correct orientation was named pCH54. 

35 pCH54 contains two direct repeated sequences of 45 bp (e.g. the 
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45 N-terminal base pairs of the mature Endol gene) which may 
recombine, deleting the region between them (Ehrlich et al.; 
198 6) . This recombination event, however, occurs with a very low 
frequency when the repeat is as small as 45 bp. In order to 
5 enrich the amount of plasmid that has recombined, a plasmid 
preparation of pCH54 was cut with the enrichment restriction 
enzyme KpnI. pCH54 contains a unique Kr>nl site between the two 
direct repeats and only non-recombinant plasmids are cut with 
KpnI, while recombinant plasmids stay circular. When B. subtilis 
10 (DN 1885) was retransfonned with this mixture, transformants 
were mostly (90%) containing recombinant plasmids, since 
B. subtjljs competent cells are not transformed with linerized 
plasmid DNA. The recombinant plasmid was called pCH57 and is 
contained in the strain CH57 . The structure was confirmed by 
restriction analysis, but the gene fusion was not confirmed by 
DNA sequencing. In this construction pCH57 the Endol gene is 
thus perfectly fused to the alpha-amylase expression signals. 



15 



20 



25 



The endoglucanase is produced extracellularly from the b. 
subtilis strain CH57, indicating that the hybrid signal cleavage 
site is functioning. The secreted Endol endoglucanase 
processed to the expected M r , namely 58000 D. 



is 



The production of the Endol cellulase from CH57 was evaluated in 
two different media, NY (overnight at 37 *C) and BPX (7 days at 
37 *c). The BPX medium is a very rich medium in which the 
nutrients are slowly released, thus keeping the cells in an 
early stationary fase for several days during fermentation. The 
alpha-amylase expression signals function particularly well in 
3 0 this medium. The results appear from Table 1 below. 
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Table 1 



STRAIN 


PLASMID 


U/ml 


U/ml 






NY 


BPX 


CH7 


pCH7 


6.5 


55. 0 


CH57 


pCH57 


13.0 


325. 0 


DN1885 




0.3 


40. 0 



15 Analysis of the culture supernatant (BPX-medium) on PAGE 
revealed a dominant (90%) endoglucanase band corresponding to a 
concentration of endoglucanase in the supernatant of appro*. 0.5 
g/L. 

20 Example 2 

Endoce] lulase 2 (Endo_2) 
DNA-seq uenoe 



The nucleotide sequence of endocellulase 2 (Endo2) was deduced 
from the plasmid P PL382 which is described above. The plasmid 
contains approx. 2500 bp. of Bacillus spp . PL23 6 DNA. The 
sequence was determined by the chemical modification method 
(Maxam and Gilbert, 1980) using the partial restriction map and 
the sequencing strategy outlined in Fig. 9. 

The complete nucleotide sequence is shown in Sequence Listing 
ID#3. A computer analysis of this sequence revealed only one 
open reading frame long enough to encode for the approx. 56000 
D protein detected in the zymogram analysis of the extract from 
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the cellulase positive clone E.coli MC1000 (pPK382 ) . This 
sequence begins at position 172 and ends at 1869 and encodes an 
enzyme consisting of 566 amino acids. The calculated M r is 62551 
D which is slightly higher than the M r of 56000 D determined by ? 
5 zymogram analysis. This difference could be due to inaccuracy in 

the zymogram analysis or to post-translational processing beyond £ 
the expected processing of the signal peptide. The ATG 
initiation codon in position 172 was selected because it was the 
only initiation codon within the open reading frame, which was 
10 proceeded by a ribosome binding site AAGGAGG (Mclaughlin et al . , 
1981) . 

This initiation codon was followed by a signal sequence-like . 
sequence, and by use of the signal sequence cleavage model 
15 proposed by Heijne (1983), the cleavage site could be predicted 
to be between the two alanine residues in the middle of the 
sequence Leu-Ala-Ala-Ala. The signal sequence of the Endol2 is 
thus 3 0 amino acids long. 

20 The region upstream of the open reading frame contained a 
sequence homologous with the sigma 43 type promoters of B . subt- 
ilis (Johnson et al., 1983) at position 46-75. 

This sequence consists of TTTACA as the -35 region and TATTAT as 
25 the -10 region; the two are separated by 18 nucleotides. 

A palindromic repeat sequence of 13 bp. was found downstream of 
the termination codon at position 1956-1981, which seems to 
resemble a rho-independent terminator (Rosenberg and Court, 
30 1979) . j 

Southern analysis * 

The Bacillus sdp . (PL236) chromosomal DNA was digested with 
3 5 HindTII . Pstl ■ EcoRI and Xhol and the plasmid pPI382 was used as 
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probe for the hybridization. The hybridization pattern 
obtained confirmed that the Bacillus s pp . PL23 6 DNA was cloned 
in non-deleted form, that the two Hindi! I fragments from pPL382 
was continuous on the Bacillus spo.. PL 236 chromosome and that 
the Endo2 gene was different from the other cloned endoglucana- 
ses . 



Express ion of Endo2 in B. subtil -ic 

10 Plasmid pPL382 was transformed to B.subtilis DN1885 to achieve 
secretion of the mature Endo2 product. B. subtil is DN1885 
(PPL382) was grown aerobically in 64 0 ml LB-medium containing 10 
Mg/ml tetracycline for 30 hours. The supernatant was 
concentrated by precipitation for 24 hours with (NH A ) 2 S0 4 at 7 0% 
15 saturation. After 5 hours of dialysis against 100 mM Tris-HCl pH 
7, the concentrated supernatant was heated to 55 *c for 15 
minutes. Denatured protein was removed by centrifugation and the 
soluble proteins were subsequently precipitated with (NH<) 2 S0 4 at 
70% saturation. The resolubilized proteins were dialyzed against 
20 ioo mM Tris-HCl pH 7 with a final volume of 2 ml and were 
applied to a 80 cm x 1 cm gel filtration column containing 
Ultrogel AcA 44 (LKB) . Active fractions were pooled, 
concentrated and applied to a SDS containing polyacrylamide gel,' 
where the endoglucanase appeared as a single band at 56 kDa. The 
activity yield of the method was approximately 10%. 

The endoglucanase comigrates with the endoglucanase obtained 
from extracts of mciooo (pPL3 82) as detected by Zymogram 
analysis in the supernatant of DN1885 (pPL382) . The endogluca- 
30 nase activity of DN1885 (pPL382) is about 25 times that of 
strain DN1885 when grown in NY medium. 



25 
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EXAMPLE 3 

Endocellulase 3 

5 Physical mapping of the endocellulase clone 3 

The endocellulase clone 3 is represented by the plasmid pPL591 
which contains a 11000 bp. EcoRI fragment of Bacillus spp . PL236 
DNA. A partial restriction map of this plasmid is shown in Fig. 
10 10. Zymogram analysis of extracts from E.coli MC1000 (pPL591) 
shows that the insert gives rise to four proteins with cellulase 
activity .* The approx. M r of these proteins were 60000 D, 56000 
D, 45000 D and 30000 D (Fig. 10) • 

15 Southern analysis showed that the EcoRI fragment from pPL591 was 
cloned in a non-deleted form from the Bacillus spp , PL23 6 
chromosome, and indicates that the DNA did not contain the DNA- 
seguences encoding the Endol and Endo2 genes. 

20 To analyse whether these proteins represented post-trans- 
lational processing products from one or several cellulase 
genes, deletion plasmids were made using the restriction enzymes 
HindllJ and Smal. Deletion of the 4 BOO bp. Smal fragment, 
resulting in the plasmid pPL538, did not eliminate any of the 

25 four cellulase bands on the zymogram. The B. spp . DNA insert on 
pPL591 contained 5 Hindlll sites. Digestion of pPL591 with 
HindXII thus gave 6 fragments of 6600 bp. (vector fragment) , 
3200 bp., 1700 bp., 1550 bp., 1350 bp. and 900 bp., where the 
1700 bp. fragment had originated from the Hindlll site within 

30 the pUN121 vector plasmid (Fig. 11). 

Elimination of all of the Hindlll fragments except the 1700 bp. 
fragment (reinserted in the opposite direction) resulted in the 
plasmid pPL54 0. Removal of the rest of the Bacillus spp . PL236 
3 5 DNA by eliminating the EcoRI fragment from pPL54 0 resulted in 
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the plasmid pPL587. Both the E. coli MC1000 (pPL540) and the 
E.coli MC1000(pPL587) were cellulase-positive and zymogram 
analysis of extracts from these clones revealed only the 4 5000 
D and 3 0000 D proteins. The protein-coding capacity of the 
Bacillus see- PL236 DNA (approx. 1500 bp.) is approx. 55000 D, 
which is too small to contain two endoclucanase genes of 45000 
D and 3 0000 D. The 3 0000 D protein on the zymogram is thus most 
likely a result of immature post-translational processing of the 
45000 D protein. The cellulase gene encoding the 45000 D protein 
was designated Endo3A . The Endo3A gene was cloned in both direc- 
tions on the plasmids pPL587 and pPL538 giving rise to the same 
two proteins, thus eliminating the chance of the protein being 
a fusion protein. 

Elimination of the 1550 bp., the 1350 bp. and the 900 bp. 
fragment resulted in the plasmid pPL542 . Extracts from E. coli 
MC1000(pPL542) revealed cellulase positive proteins of approx. 
30000 D, 45000 D, 49000 D and 56000 D (Fig. 10) . From these 
preliminary results, the existence of two additional endogluca- 
nase genes within the original insert on pPL591 and pPL538 are 
postulated. The additional endoglucanase genes are designated 
Endo3B and Endo3C. Their postulated position on the Bacillus 
SEE. PL23 6 DNA is shown in Fig. 11. The postulated model is 
based on the assumption that the 60000 D protein made from 
PPL538 and pPL591 is converted to a truncated fusion protein of 
49000 D made from pPL542 where the Hindlll fragment of 1350 bp. 
is deleted. 

DNA-secruence of Endo3A 

The DNA sequence of the Endo3A gene was deduced from the plasmid 
PPL540 containing approx. 1500 bp. of Bacillus spp . PL23 6 DNA 
using the dideoxy chain termination method. The gene was placed 
in pUC18 in both orientations, and a number of deletions were 
constructed. Standard primers were used except for one synthe- 
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tic oligonucleotide that was used for sequencing a region with 
no practical restriction sites. 

The C-terminal part of the gene was deduced from the plasmid 
5 pPL538. The partial DNA-sequence is shown in Sequence Listing 
ID#6. The sequence revealed an open reading frame coding for a 
protein with a M r of about 62000 D which is in agreement with the 
observed protein of 60JcD in the zymograms. The ATG start codon 
(position 3 0) is preceded by a typical ribosome binding site 
10 (Mclaughlin et al., 1981). The initiation codon is followed by 
a typical gram-positive signal sequence and by using the signal 
sequence cleavage model (Heijne, 1983) a signal sequence of 36 
amino acids is revealed. 

15 EXAMPLE 4 

Endocellulase 4 

Zymogram analysis 

20 

The endocellulase clone No. 4 is represented by the plasmid 
pPL592 f which contains approx. 14000 bp. of Bacillus sr>p . PL236 
DNA. A partial restriction map is shown in Fig. 1. 

25 A zymogram analysis of extracts from E. coli MC1000 (pPL592= 
revealed three cellulase active proteins with M r values of 
approx. 92000 D, 74000 D and 71000 D. Further analysis is 
necessary to determine whether these proteins are encoded by one 
or several cellulase genes. 



30 



35 



Southern analysis confirmed the origin of the cloned DNA on the 
Bacillus 5DD . PL23 6 chromosome, and indicates that the cloned 
DNA is not represented on the other endoglucanase clones. 
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EXAMPLE 5 

A. Endol cloned a nd expressed in Bacillus subtil is 

5 An agar slant was inoculated with subtilis strain CH 57 and 
incubated for 2 0 hours at 37 °C. 10 ml of a 0.9 % NaCl-solution 
was added to the test tube which was shaken to suspend the 
cells. The cell suspension was used to inoculate a 2 1 
f ermentor . 

10 

The following parameters were used to run the fermentation: 
Temperature : 3 7 ° C . 
15 Aeration: 1,1 1/minute. 
Stirring: 1100 rpm. 

Permentor: A 2 1 model with a working volume of 1,5 1. 

20 

The pH was maintained between 6.2 and 7.2 for the first 40 hours 
of fermentation. After that the pH was maintained between 6.7 
and 7.2. The pH was maintained within this range by dosing with 
NH 3 and H 3 P0 4 . 

25 

Dosing of a glucose solution was initiated after 40 hours at a 
flow rate of 3.7 ml/hour. 
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Substrate 



Potato starch degraded with Termamyl* 50 g 
Soybean meal 110 g 

5 Corn steep Liquor 16.5 g 

Alburex (potato protein) 27.5 g 
(NHJ 2 S04 2.2 g 

KH 2 P0 4 1,2 g 

Na 2 HP0 4 .2H 2 0 5,9 g 

10 Water added up to 1100 ml. 



*Termamyl is a commercial 1 i cheni f onnis a-amylase avalable 
from Novo Nordisk A/S. 



15 Glucose solution 

Glucose. H 2 0 600 g 

Citric acid 0,6 g 

Water added up to 1000 ml 

20 

The fermentation was stopped after 166 hours of fermentation at 
an OD 650 value of 122. At that time there were 12 00 ml of 
fermentation broth in the fermentor. 

25 The fermentation broth was centrifuged, and the extracellular 
volume was 400 ml containing 40 CMC-endoase units per ml (16.000 
CMC-endoase units in all) . The culture medium was further 
processed by filtration and dilution followed by concentration 
on an Amicon ultrafiltration module with a cut-off at 10.000 MW. 

30 The concentrated enzyme solution was frozen. 

Half of the frozen liquid was thawed and diluted with deionized 
water and then concentrated once more on an Amicon 
ultrafiltration module. The total yield was 4471 CMC-endoase 
35 units (from 8000 CMC-endoase units) . 
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The total volume of 1050 ml was subjected to ion exchange 
chromatography at pH 7 . The enzyme was bound to a DEAE- 
Sephacryl anion exchange column (3 00 ml volume) at pH 7 (50 mM 
5 tris-HCl) . The Endol enzyme was eluted at pH 7 with 0.3 M NaCl. 

The purified enzyme has a molecular weight of 58,000 D on SDS- 
PAGE. The pi is 4.0. Its activity is 3 0 CMC-endoase units per mg 
protein. 

10 

The protein determination is based on the amino acid composition 
of the enzyme deduced from the DNA sequence: 13 tryptophan/ 3 0 
tyrosine and a molecular weight of 57 , 566 D. The extinction, 
coefficient is calculated by means of the following formula: 

15 

( 13 X 5559 + 30 X 1197 ) / 57566 = 1.88 

The purified enzyme has an endoglucanase activity of 57 CMC- 
endoase units- per ml and an absorbance at 280nm of 3.6. 
20 Thus, (57 x 1.88)/3.6 = 30 CMC-endoase units per mg protein. 

B. Stability of Endol in detergents 

The following 4 detergent compositions were used: 

25 

1. USA liquid detergent: 2 gram per liter of 6° hardness water 
( 1 part tap water to 2 parts deionized water) . The pH was 
measured to 7,29. 

30 2. USA Heavy Duty Powder detergent: 0,9 gram per liter of 6° 
hardness water. The pH was measured to 9,2. 

3. Heavy Duty Powder detergent (2) with bleach and activator: 
0.12 gram/1 Na-perborate tetrahydrate and 0,088 gram/1 NOBS. The 
35 pH was measured to 9,2. 
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10 



20 



25 



30 



4. European Heavy Duty Powder detergent with bleach and 
activator (Batch DR 8806 Europe). 5 gram per liter in 9° 
hardness water . 

Celluzyme™ (batch CAX 007 crude enzyme with cellulase and other 
enzymes) 23 53 CMC-endoase units per gram was compared with Endol 
3 0000 CMC-endoase units per gram. 



The enzymes were diluted to 3 CMC-endoase units per ml in all 4 
detergent solutions: The endoglucanase activity after dilution 
was measured as described above (by determining the decrease in 
the viscosity of CMC) . The endoglucanase activity after 60 min. 
incubation at 4 0 *C was measured and compared with the initial 
15 activity. The following results were obtained: 

Detergent solution 12 3 4 

Celluzyme™ . 89% 75% 66% 75% 



Endo1 106% 90% 97% 98% 

The standard deviation is 10%. 

It appears from the table that Endol is more stable at a pH of 
9-10 compared with Celluzyme™ in these detergents. 

EXAMPTT? A 

A. Preparation of full-le n ath f~75 kDl Endol in B. subtilis 



10 1 of ub medium containing l mM CuCl 2 and 10 Mg/ml 
chloramphenicol was inoculated with 10 ml of an overnight 
35 culture of subtilis DN969 (B. Diderichsen et al . , 
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J" . Bacterid . 172 (8). 1990, pp. 4315-4321) containing the plasmid 
pCH7 (described above in example 1) , divided among 10 sterile 2 
1 flasks and incubated with vigorous shaking for 3 6 hours at 
37 °C. The culture was centrifuged for 10 minutes at lOOOOxg and 
5 4°C after which EDTA, pH 8, was added to the supernatant to a 
, final concentration of 5 mM. 

25 g of Avicel PH-105 which had been hydrated in ethanol and 
washed with destilled water was added to the supernatant which 
10 was left standing with gentle stirring for 2 hours at 4°C. The 
supernatant/Avicel mixture was centrifuged for 10 minutes at 
lOOOOxg and 4°C. The cleared supernatant was decanted off 
immediately after the rotor had stopped. 

15 The Avicel/enzyme cake was resuspended and washed in 2 00 ml of 
ImM EDTA, and the mixture was centrifuged for 1 minute at 
lOOOOxg, This procedure was repeated twice. The Avicel/enzyme 
cake was then resuspended in 150 ml (1% triethyl amine and 1 mM 
EDTA) and was -left standing with vigorous stirring for 1 hour at 

20 4°C. The mixture was centrifuged for 1 minute at lOOOOxg and 
4°c. The supernatant was retained. This procedure was repeated 
twice. 

The solution of enzyme and triethylamine (about 300 ml) was was 
25 evaporated in vacuo to 100 ml. The temperature of the solution 
was not allowed to exceed 10 *C. The pH was adjusted to 7 by 
adding 1M HC1, and the solution was frozen at -70 °C. 

The amount of protein in the 100 ml enzyme solution was 
* 3 0 determined to be 40 mg by means of a Bradford reagent (available 
from BioRad) using bovine serum albumin as the standard. 

B . Characterisation of the -75 kD Endol 

35 The enzyme obtained above had a purity of about 90%. The enzyme 
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was found to have a molecular weight of 75 kD on SDS-PAGE. 

In iirmiunoprecipitation experiments (carried out by rocket 
Immunoelectrophoresis in agarose gel as described by N. Axelsen 
et al., Chapter 2 in A Manual of ouantit a tivp 
Immunoelectrophoresis . Blackwell Scientific Publ. 1973), -75 kD 
Endo1 was inununoreactive with a monospecific rabbit antibody 
raised against the core region (-58 kD form) of Endol. 

The -75 kD and -58 kD (processed) forms of Endol were tested for 
their ability to bind to cellulose (Avicel) . io fig of each of 
-75 kD Endol and -58 kD Endol were added to 50 fil of a cellulose 
suspension (0.1% w/w Avicel, 5 mM EDTA, pH 8.0) . The suspensions, 
were shaken for 3 0 minutes, and the Avicel was harvested at 
10000 rpm for 2 minutes. The amount of Endol cellulase remaining 
in the supernatant was analysed by SDS-PAGE. More than 95% of 
the -75 kD Endol was bound to the cellulose, while less than 5% 
of the -58 kD Endol was similarly bound. This shows that the c- 
terminal part of the -75 kD Endol cellulase comprises a 
cellulose-binding domain. 

Extensive amino acid sequence homology was found between this 
region (from amino acid 554 to 700) of the Endol cellulase and 
other cellulases, e.g. an endocellulase from Bacillus subtil is 
(Nakamura et al., 1987), the middle part of the bifunctional 
cellulase from Caldocellum sacbharolvtieum (D.J. Saul et al., 
Nucl. Acids Res. 17, 1988, p. 439), two endocellulases from 
Clostridium stercorarium (W. Schwarz et al. , Biotech . T^-n- . 11, 
1989, pp. 461-466) . 

C. Colour clari fication effect of -75 kp Endol 

The colour clarification effect of -75 kD Endol was determined 
by exposing a prewashed worn textile surface to the enzyme and 
then measuring the clarity of the surface colour compared to the 
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clarity of the surface colour of textiles which had not been 
treated with the enzyme. 

Black 100% cotton swatches (15 x 10 cm) were prewashed and 
5 tumble-dried under the following conditions 



10 



Detergent : 



Temperature : 
Washing time: 
Drying time: 
No* prewashing/ 
drying treatments : 15 



Keminus (available from Irma A/S) , 
1.5g/l 
70"C 

60 minutes 
3 0 minutes 



15 



20 



The swatches were prewashed in a conventional washing machine 
(Miele Deluxe Electronic W761) . After each wash f the swatches 
were dried in a tumble-drier. The visual effect of the 
prewashing/drying was that the surface colour turned greyish due 
to the presence of damaged cellulose fibres causing the worn 
look. 



After prewashing, the swatches were washed in a Terg-O-Tometer 
(toploaded mini washing machine) under the following conditions 



25 



30 



Liquid volume: 
Agitation: 
Detergent: 
Washing time: 
Washing tempera- 
ture: 

No. of swatches: 
-75 kD Endol 
dosage : 
pH: 



800 ml 

100 movements/minute 
Standard detergent, 5g/l 
30 minutes 

40°C 
2 

0 and 60 CMC endoase units/1 
7.0 



35 



No . o f treatments : 3 
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Standard detergent: 

IAS NANSA 1169/P: 10% 
AE Berol 160: 15% 
Ethanol, 96%: io% 
TEA: 5% 
Water: 60% 

After each wash, the swatches were rinsed in tap water and dried 
at room temperature . 

The surface colour of the swatches was analysed by measuring 
reflected light. White light was projected onto the surface, and 
the reflection/remission was measured at 16 wavelengths (400 nm 
- 7 00 nm) . The results from the measurements were processed (by 
means of an "Elrepho 2000" apparatus available from Datacolor, 
Switzerland) into Hunter coordinates of which the L-coordinate 
represents the. grey scale values. Each swatch was analysed twice 
on each side, and the results shown below are are a total 
average from the measurements of the two swatches from the same 
treatment, in the table, white is L = 100, and black is l = o. 

Dosage 0 CMC endoase/160 CMC endoase/1 

L 16.78 15.50 

S,D * 0.08 0.03 

Delta L _ 1b28 

Comparable results were obtained with Celluzyme™ (batch PPC 2174 
containing a mixture of enzymes from Humicola in s mor, e DSM 
1800) 

Dosage 

(CMC endoase/1) o 15 30 60 

Delta L ~ 1.02 1.48 1.90 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: Novo Nordisk A/S 
(ii) TITLE OF INVENTION: An Enzyme Exhibiting Cellulase Activity 
10 (iii) NUMBER OF SEQUENCES: 7 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Novo Nordisk A/S, Patent Department 

(B) STREET: Novo Alle 
15 (C) CITY: Bagsvaerd 

(E) COUNTRY: DENMARK 

(F) ZIP: DK-2880 

(v) COMPUTER READABLE FORM: 
20 (A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: 

25 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

30 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: DK 164/90 

(B) FILING DATE: 19-JAN-1990 

(viii) ATTORNEY/AGENT INFORMATION: 
35 (A) NAME: Thalse-Madsen, Kine Birgit 

(B) REGISTRATION NUMBER: 

(C) REFERENCE/DOCKET NUMBER: 3425.204-W0 

(ix) TELECOMMUNICATION INFORMATION: 
40 (A) TELEPHONE: +45 4444 8888 

(B) TELEFAX: +45 4449 3256 

(C) TELEX: 37304 



45 (2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2977 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus lautus 
5 (B) STRAIN: NCIMB 40250 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 677.. 2776 
10 (D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



15 


ATCTTGAAGC 


TTCGAGGTTC 


GCGTAACCAG 


GTCGATCGGC 


TGACTTTCCT 


CGCCAAGCTG 


60 




CACCATGGCT 


TCATAGATTA 


ATTGGTGAGG 


AGGATCATAG 


AAATCTTCCG 


TCCGCACCGC 


120 


20 


TCCATGGCGG 


TAATCAGCGC 


TTCCGACTGC 


AGTACAGAGC 


GGCCGATACA 


AATGCAGCCG 


180 


ACAACCAGAT 


CAAGCCGTCC 


TTCAACATCA 


AAAACAACGG 


TACTTCGGCT 


GTTGATTTAA 


240 




GCACGCTCAA 


AATCCGCTAC 


TACTTCACCA 


AGGATGGTTC 


TGCGGCGGTG 


AACGGCTGGA 


300 


25 


TCGACTGGGC 


GCAGCTCGGC 


GGCAGCAACA 


TTCAGATCTC 


GTTTGGCAAC 


CATACTGGCA 


360 




CGAATTCGGA 


TACGT.ACGTG 


GAGCTGAGCT 


TCTCGTCCGA 


GGCAGGCTCG 


ATTGCGGCGG 


420 


30 


GCGGCCAATC 


CGGTGAAATC 


CAGCTGCGCA 


TGTCCAAGAC 


GGACTGGTCG 


AACTTTAACG 


480 


AGGCGAACGA 


CTACTCGTTC 


GATGGGACGA 


AGACGGCCTT 


TGCTGACTGG 


GATCGGGTCG 


540 




TATTGTACCA 


GAACGGCCAA 


ATAGTGTGGG 


GAACTGCTCC 


ATAAACCGAT ACAGGGGAAT 


600 


35 


GTGCCGGAAC 


CGCTCTTTTG 


CAGGGCAGAC 


TGGCGGTATC 


CCTTGCTGAA ATGACTATTC 


660 




CTGGGAGGGA 


TCAAAA ATG 
Met 


AAG ACA AGA CAA AGA AAG CGG CTG TTC GTC 
Lys Thr Arg Gin Arg Lys Arg Leu Phe Val 


709 



1 5 10 

40 

AGT GCG GCG CTG GCA GTA TCC TTG ACA ATG ACC GTA CCG ATG CCC GCT 757 
Ser Ala Ala Leu Ala Val Ser Leu Thr Met Thr Val Pro Met Pro Ala 
15 20 25 

45 TCT GTA AAT GCA GCT GCG AGT GAT GTC ACT TTC ACG ATT AAT ACG CAG 805 
Ser Val Asn Ala Ala Ala Ser Asp Val Thr Phe Thr He Asn Thr Gin 
30 35 40 

TCG GAA CGT GCA GCG ATC AGC CCC AAT ATT TAC GGA ACC AAT CAG GAT 853 
50 Ser Glu Arg Ala Ala lie Ser Pro Asn He Tyr Gly Thr Asn Gin Asp 
45 50 55 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



CTG AGC GGG ACG GAG AAC TGG TCA TCC CGC AGG CTC GGA GGC AAC CGG 
Leu Ser Gly Thr Glu Asn Trp Ser Ser Arg Arg Leu Gly Gly Asn Arg 
60 65 70 75 

CTG ACG GGT TAC AAC TGG GAG AAC AAC GCA- TCC AGC GCC GGA AGG GAC 
Leu Thr Gly Tyr Asn Trp Glu Asn Asn Ala Ser Ser Ala Gly Arg Asp 
80 85 go 

TGG CTT CAT TAC AGC GAT GAT TTT CTC TGC GGC AAC GGT GGT GTT CCA 
Trp Leu His Tyr Ser Asp Asp Phe Leu Cys Gly Asn Gly Gly Val Pro 
95 100 105 

GAC ACC GAC TGC GAC AAG CCG GGG GCG GTT GTT ACC GCT TTT CAC GAT 
Asp Thr Asp Cys Asp Lys Pro Gly Ala Val Val Thr Ala Phe His Asp 
HO H5 120 

AAA TCT TTG GAG AAT GGA GCT TAC TCC ATT GTA ACG CTG CAA ATG GCG 
Lys Ser Leu Glu Asn Gly Ala Tyr Ser He Val Thr Leu Gin Met Ala 
125 130 135 

GGT TAT GTG TCC CGG GAT AAG AAC GGT CCA GTT GAC GAG AGT GAG ACG 
Gly Tyr Val Ser Arg Asp Lys Asn Gly Pro Val Asp Glu Ser Glu Thr 
140 145 150 i 5 5 

GCT CCG TCA CCG CGT TGG GAT AAG GTC GAG TTT GCC AAA AAT GCG CCG 
Ala Pro Ser Pro Arg Trp Asp Lys Val Glu Phe Ala Lys Asn Ala Pro 
160 165 170 

TTC TCC CTT CAG CCT GAT CTG AAC GAC GGA CAA GTG TAT ATG GAT GAA 
Phe Ser Leu Gin Pro Asp Leu Asn Asp Gly Gin Val Tyr Met Asp Glu 
175 180 185 

GAA GTT AAC TTC CTG GTC AAC CGG TAT GGA AAC GCT TCA ACG TCA ACG 
Glu Val Asn Phe Leu Val Asn Arg Tyr Gly Asn Ala Ser Thr Ser Thr 
190 195 200 

GGC ATC AAA GCG TAT TCG CTG GAT AAC GAG CCG GCG CTG TGG TCT GAG 
Gly lie Lys Ala Tyr Ser Leu Asp Asn Glu Pro Ala Leu Trp Ser Glu 
205 210 215 

ACG CAT CCA AGG ATT CAT CCG GAG CAG TTA CAA GCG GCA GAA CTC GTC 
Thr His Pro Arg He His Pro Glu Gin Leu Gin Ala Ala Glu Leu Val 
220 225 230 235 

GCT AAG AGC ATC GAC TTG TCA AAG GCG GTG AAG AAC GTC GAT CCG CAT 
Ala Lys Ser He Asp Leu Ser Lys Ala Val Lys Asn Val Asp Pro His 
240 245 250 

GCC GAA ATA TTC GGT CCT GCC CTT TAC GGT TTC GGC GCA TAT TTG TCT 
Ala Glu He Phe Gly Pro Ala Leu Tyr Gly Phe Gly Ala Tyr Leu Ser 
255 260 265 



901 



949 



997 



1045 



1093 



1141 



1189 



1237 



1285 



1333 



1381 



1429 



1477 
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CTG CAG GAC GCA CCG GAT TGG CCG AGT TTG CAA GGC AAC TAC AGC TGG 1525 
Leu Gin Asp Ala Pro Asp Trp Pro Ser Leu Gin Gly Asn Tyr Ser Trp 
270 275 280 

5 TTT ATC GAT TAC TAT CTG GAT CAG ATG AAG AAT GCT CAT ACG CAG AAC 1573 
Phe lie Asp Tyr Tyr Leu Asp Gin Met Lys Asn Ala His Thr Gin Asn 
285 290 295 

GGC AAA AGA TTG CTC GAT GTG CTG GAC GTC CAC TGG TAT CCG GAA GCA 1621 
10 Gly Lys Arg Leu Leu Asp Val Leu Asp Val His Trp Tyr Pro Glu Ala 
300 305 310 315 

CAG GGC GGA GGC CAG CGA ATC GTC TTT GGC GGG GCG GGC AAT ATC GAT 1569 
Gin Gly Gly Gly Gin Arg lie Val Phe Gly Gly Ala Gly Asn He Asp 
15 320 325 330 

ACG CAG AAG GCT CGC GTA CAA GCG CCA AGA TCG CTA TGG GAT CCG GCT 1717 

Thr Gin Lys Ala Arg Val Gin Ala Pro Arg Ser Leu Trp Asp Pro Ala 

335 340 345 

20 

TAC CAG GAA GAC AGC TGG ATC GGC ACA TGG TTT TCA AGC TAC TTG CCC 1765 

Tyr Gin Glu Asp Ser Trp He Gly Thr Trp Phe Ser Ser Tyr Leu Pro 

350 355 360 

25 TTA ATT CCG AAG CTG CAA TCT TCG ATT CAG ACG TAT TAT CCG GGT ACG 1813 
Leu lie Pro Lys Leu Gin Ser Ser lie Gin Thr Tyr Tyr Pro Gly Thr 
365 . 370 375 

AAG CTG GCG ATC ACA GAG TTC AGC TAC GGC GGA GAC AAT CAC ATT TCG 1861 
30 Lys Leu Ala He Thr Glu Phe Ser Tyr Gly Gly Asp Asn His He Ser 
380 385 390 395 

GGA GGC ATA GCT ACC GCG GAC GCG CTC GGC ATT TTT GGA AAA TAT GGC 1909 
Gly Gly He Ala Thr Ala Asp Ala Leu Gly lie Phe Gly Lys Tyr Gly 
35 400 405 410 

GTT TAT GCC GCG AAT TAC TGG CAG ACG GAG GAC AAT ACC GAT TAT ACC 1957 
Val Tyr Ala Ala Asn Tyr Trp Gin Thr Glu Asp Asn Thr Asp Tyr Thr 
415 420 425 

40 

AGC GCT GCT TAC AAG CTG TAT CGC AAC TAC GAC GGC AAT AAA TCG GGG 2005 
Ser Ala Ala Tyr Lys Leu Tyr Arg Asn Tyr Asp Gly Asn Lys Ser Gly 
430 435 440 

45 TTC GGC TCG ATC AAA GTG GAC GCC GCT ACG TCC GAT ACG GAG AAC AGC 2053 
Phe Gly Ser He Lys Val Asp Ala Ala Thr Ser Asp Thr Glu Asn Ser 
445 450 . 455 

TCG GTA TAC GCT TCG GTA ACT GAC GAG GAG AAT TCC GAA CTC CAC CTG 2101 
50 Ser Val Tyr Ala Ser Val Thr Asp Glu Glu Asn Ser Glu Leu His Leu 
460 465 470 475 
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10 



20 



25 



30 



40 



45 



50 



ATC GTG CTG AAT AAA AAT TTC GAC GAT CCG ATC AAC GCT ACT TTC CAG 
lie Val Leu Asn Lys Asn Phe Asp Asp Pro lie Asn Ala Thr Phe Gin 
480 485 490 

CTG TCT GGT GAT AAA ACC TAC ACA TCC GGG AGA GTA TGG GGC TTC GAC 
Leu Ser Gly Asp Lys Thr Tyr Thr Ser Gly Arg Val Trp Gly Phe Asp 
495 500 505 

CAA ACC GGA TCC GAC ATT ACG GAA CAA GCA GCT ATA ACG AAT ATT AAC 
Gin Thr Gly Ser Asp He Thr Glu Gin Ala Ala He Thr Asn He Asn 
510 515 520 



AAC AAT CAA TTC" ACG TAT ACG CTT CCT CCA TTG TCG GCT TAC CAC ATT 
Asn Asn Gin Phe Thr Tyr Thr Leu Pro Pro Leu Ser Ala Tyr His He 
15 525 530 535 

GTT CTG AAA GCG GAT AGC ACC GAA CCG GTC AAC TCC GAT CTC GTC GTG 
Val Leu Lys Ala Asp Ser Thr Glu Pro Val Asn Ser Asp Leu Val Val 
540 545 550 555 



CAG TAT AAG GAC GGT GAT CGC AAC AAT GCA ACC GAC AAT CAG ATC AAG 
Gin Tyr Lys Asp Gly Asp Arg Asn Asn Ala Thr Asp Asn Gin He Lys 
550 565 570 

CCG CAT TTC AAT ATT CAA AAT AAA GGG ACC AGC CCG GTA GAT CTG AGT 
Pro His Phe Asn lie Gin Asn Lys Gly Thr Ser Pro Val Asp Leu Ser 
575 . 580 585 

TCC CTT ACC CTG CGC TAC TAT TTT ACC AAA GAC AGC TCT GCA GCG ATG 
Ser Leu Thr Leu Arg Tyr Tyr Phe Thr Lys Asp Ser Ser Ala Ala Met 
590 595 600 



AAC GGC TGG ATC GAT TGG GCG AAG CTC GGC GGC AGC AAC ATT CAG ATT 
Asn Gly Trp lie Asp Trp Ala Lys Leu Gly Gly Ser Asn He Gin He 
35 605 610 615 

TCG TTC GGT AAT CAT AAT GGC GCG GAT TCG GAT ACG TAC GCG GAG CTG 
Ser Phe Gly Asn His Asn Gly Ala Asp Ser Asp Thr Tyr Ala Glu Leu 
620 625 630 635 



GGC TTC TCG TCC GGC GCA GGC TCG ATT GCG GAG GGC GGT CAA TCC GGC 
Gly Phe Ser Ser Gly Ala Gly Ser He Ala Glu Gly Gly Gin Ser Gly 
640 645 650 

GAA ATC CAG CTG CGC ATG TCG AAG GCG GAC TGG TCG AAC TTC AAC GAG 
Glu He Gin Leu Arg Met Ser Lys Ala Asp Trp Ser Asn Phe Asn Glu 
655 660 665 

GCG AAC GAC TAC TCG TTC GAT GGG GCG AAG ACG GCC TAT ATA GAT TGG 
Ala Asn Asp Tyr Ser Phe Asp Gly Ala Lys Thr Ala Tyr He Asp Trp 
670 675 680 



2149 



2197 



2245 



2293 



2341 



2389 



2437 



2485 



2533 



2581 



2629 



2677 



2725 



10 



15 
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GAT CGC GTG ACG CTA TAC CAA GAC GGA CAA CTC GTA TGG GGA ATC GAG 2773 
Asp Arg Val Thr Leu Tyr Gin Asp Gly Gin Leu Val Trp Gly He Glu 
685 690 695 

CCG TAGAAGATGA CTAGACAACA TTAGTGATGA GACGCGGCCG GCCATAACGG 2826 

Pro 

700 

CTGTCTTGAC TCTGATTCGA TCAAAAAATC AAAGCAAAGG GGATGAAAGT AATGAATGTT 2886 

GCGATTCAAA AGAGAATCGG ATCAATATTG ATGATTGCCT CACTAATTAT TAGCTTATTG 2946 

CCGTTAGGGA GCAGCAGAAC GTATGCTGCA G 2977 

(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 700 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Lys Thr Arg Gin Arg Lys Arg Leu Phe Val Ser Ala Ala Leu Ala 
15 10 15 

3 0 Val Ser Leu Thr Met Thr Val Pro Met Pro Ala Ser Val Asn Ala Ala 

20 25 30 



35 



50 



Ala Ser Asp Val Thr Phe Thr He Asn Thr Gin Ser Glu Arg Ala Ala 
35 40 45 

He Ser Pro Asn He Tyr Gly Thr Asn Gin Asp Leu Ser Gly Thr Glu 
50 55 60 



Asn Trp Ser Ser Arg Arg Leu Gly Gly Asn Arg Leu Thr Gly Tyr Asn 
40 65 70 75 80 

Trp Glu Asn Asn Ala Ser Ser Ala Gly Arg Asp Trp Leu His Tyr Ser 
85 90 95 

45 Asp Asp Phe Leu Cys Gly Asn Gly Gly Val Pro Asp Thr Asp Cys Asp 
100 105 110 



Lys Pro Gly Ala Val Val Thr Ala Phe His Asp Lys Ser Leu Glu Asn 
115 120 125 

Gly Ala Tyr Ser He Val Thr Leu Gin Met Ala Gly Tyr Val Ser Arg 

130 135 140 
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Asp Lys Asn Gly Pro Val Asp Glu Ser Glu Thr Ala Pro Ser Pro Arg 

145 150 155 160 

5 Trp Asp Lys Val Glu Phe Ala Lys Asn Ala Pro Phe Ser Leu Gin Pro 

165 170 175 



10 



Asp. Leu Asn Asp Gly Gin Val Tyr Met Asp Glu Glu Val Asn Phe Leu 
180 185 190 

Val Asn Arg Tyr Gly Asn Ala Ser Thr Ser Thr Gly He Lys Ala Tyr 
195 200 205 



15 



Ser Leu Asp Asn Glu Pro Ala Leu Trp Ser Glu Thr His Pro Arg He 
210 215 220 



His Pro Glu Gin Leu Gin Ala Ala Glu Leu Val Ala Lys Ser He Asp 
225 230 235 240 

20 Leu Ser Lys Ala Val Lys Asn Val Asp Pro His Ala Glu He Phe Gly 

245 250 255 



25 



30 



Pro Ala Leu Tyr Gly Phe Gly Ala Tyr Leu Ser Leu Gin Asp Ala Pro 
260 265 270 

Asp Trp Pro Ser Leu Gin Gly Asn Tyr Ser Trp Phe He Asp Tyr Tyr 
275 . 280 285 

Leu Asp Gin Met Lys Asn Ala His Thr Gin Asn Gly Lys Arg Leu Leu 
290 295 300 



Asp Val Leu Asp Val His Trp Tyr Pro Glu Ala Gin Gly Gly Gly Gin 
305 310 315 320 

35 Arg He Val Phe Gly Gly Ala Gly Asn He Asp Thr Gin Lys Ala Arg 

325 330 335 



Val Gin Ala Pro Arg Ser Leu Trp Asp Pro Ala Tyr Gin Glu Asp Ser 
340 345 350 

Trp He Gly Thr Trp Phe Ser Ser Tyr Leu Pro Leu He Pro Lys Leu 
355 360 365 

Gin Ser Ser He Gin Thr Tyr Tyr Pro Gly Thr Lys Leu Ala He Thr 
370 375 380 

Glu Phe Ser Tyr Gly Gly Asp Asn His He Ser Gly Gly He Ala Thr 
385 390 395 400 

50 Ala Asp Ala Leu Gly He Phe Gly Lys Tyr Gly Val Tyr Ala Ala Asn 

405 410 415 



40 



45 
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Tyr Trp Gin Thr Glu Asp Asn Thr Asp Tyr Thr Ser Ala Ala Tyr Lys 
420 425 430 

Leu Tyr Arg Asn Tyr Asp Gly Asn Lys Ser Gly Phe Gly Ser He Lys 
5 435 440 445 

Val Asp Ala Ala Thr Ser Asp Thr Glu Asn Ser Ser Val Tyr Ala Ser 
* 450 455 460 

io Val Thr Asp Glu Glu Asn Ser Glu Leu His Leu He Val Leu Asn Lys 
465 470 475 480 



15 



30 



45 



Asn Phe Asp Asp Pro He Asn Ala Thr Phe Gin Leu Ser Gly Asp Lys 

485 490 495 

Thr Tyr Thr Ser Gly Arg Val Trp Gly Phe Asp Gin Thr Gly Ser Asp 

500 505 510 



He Thr Glu Gin Ala Ala He Thr Asn He Asn Asn Asn Gin Phe Thr 
20 515 520 525 

Tyr Thr Leu Pro Pro Leu Ser Ala Tyr His He Val Leu Lys Ala Asp 
530 535 540 

25 Ser Thr Glu Pro Val Asn Ser Asp Leu Val Val Gin Tyr Lys Asp Gly 
545 550 555 560 



Asp Arg Asn Asn Ala Thr Asp Asn Gin lie Lys Pro His Phe Asn lie 

565 570 575 

Gin Asn Lys Gly Thr Ser Pro Val Asp Leu Ser Ser Leu Thr Leu Arg 
580 585 590 



Tyr Tyr Phe Thr Lys Asp Ser Ser Ala Ala Met Asn Gly Trp He Asp 
35 595 600 605 

Trp Ala Lys Leu Gly Gly Ser Asn He Gin He Ser Phe Gly Asn His 
610 615 620 

40 Asn Gly Ala Asp Ser Asp Thr Tyr Ala Glu Leu Gly Phe Ser Ser Gly 
625 630 635 640 



Ala Gly Ser He Ala Glu Gly Gly Gin Ser Gly Glu lie Gin Leu Arg 

645 650 655 

Met Ser Lys Ala Asp Trp Ser Asn Phe Asn Glu Ala Asn Asp Tyr Ser 

660 665 670 



Phe Asp Gly Ala Lys Thr Ala Tyr He Asp Trp Asp Arg Val Thr Leu 
50 675 680 685 
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Tyr Gin Asp Gly Gin Leu Val Trp Gly He Glu Pro 
690 695 700 

(2) INFORMATION FOR SEQ ID N0:3: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
10 (D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Bacillus lautus 

(B) STRAIN: NCI MB 40250 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
20 (B) LOCATION: 172.. 1869 

(D) OTHER INFORMATION: 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

GGGGTTTTGT TCATTTCAAG AAAGAGCACG AATAAACATC GCTAATTTAC ACATGATTTA 60 

TTCTCTAATT ATTATGGTGC ATGCCAATTG GAAAATGTAG TAGATTAGTC ATCGTAACCT 120 

30 GCTTTCATGC CAAGATATGT AATTTTTAAA AAAGAGTAAA GGAGAATTCA G ATG AAA 177 

Met Lys 
1 

AAA CGT AGA AGC AGT AAA GTT ATT CTT TCG TTG GCC ATC GTT GTT GCA 225 
35 Lys Arg Arg Ser Ser Lys Val He Leu Ser Leu Ala lie Val Val Ala 
5 10 15 

TTA TTG GCA GCC GTC GAA CCT AAT GCC GCT TTA GCA GCG GCT CCA CCA 273 
Leu Leu Ala Ala Val Glu Pro Asn Ala Ala Leu Ala Ala Ala Pro Pro 
40 20 25 30 

AGT GCC ATG CAG TCC TAT GTT GAA GCG ATG CAG CCT GGC TGG AAC CTT 321 

Ser Ala Met Gin Ser Tyr Val Glu Ala Met Gin Pro Gly Trp Asn Leu 

35 40 45 50 

45 

GGC AAT TCT CTG GAT GCT GTC GGT GCG GAT GAG ACG CTG GCA CGG GGC 369 

Gly Asn Ser Leu Asp Ala Val Gly Ala Asp Glu ThrLeu Ala Arg Gly 
55 60 65 

50 AAT CCG CGG ATC ACG AAA GAG CTC ATT CAG AAC ATC GCT GCG CAA GGC 417 
Asn Pro Arg He Thr Lys Glu Leu He Gin Asn He Ala Ala Gin Gly 
70 75 80 
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TAT 
Tyr 



GCC 
Ala 



10 GTC 
Val 
115 

CAT 
15 His 



20 



25 



40 



45 



CAA 
Gin 



TTC 
Phe 



CGC 
Arg 



30 CTG 
Leu 
195 

GGC 
35 Gly 



CCC 
Pro 



AAC 
Asn 



Phe 



50 AAT 
Asn 
275 



AAG AGC ATA CGG ATT CCT GTT ACC TGG GAT TCC CAT ATC GGC GCG 
Lys Ser He Arg He Pro Val Thr Trp Asp Ser His He Gly Ala 
85 90 g5 

CCA AAT TAT CAA ATT GAA GCT GCG TAC CTC AAT CGA GTG CAG GAG 
Pro Asn Tyr Gin He Glu Ala Ala Tyr Leu Asn Arg Val Gin Glu 
100 105 no 

GTA CAG TGG GCT TTG GAC GCG AAC CTC TAT GTG ATG ATT AAT GTC 
Val Gin Trp Ala Leu Asp Ala Asn Leu Tyr Val Met He Asn Val 
120 125 130 

CAT GAT TCC TGG CTA TGG ATC AGC AAA ATG GAG TCG CAG CAC GAT 
His Asp Ser Trp Leu Trp He Ser Lys Met Glu Ser Gin His Asd 
135 140 145 

GTA CTG GCC CGT TAT AAT GCG ATT TGG ACG CAA ATT GCC AAC AAG 
Val Leu Ala Arg Tyr Asn Ala He Trp Thr Gin lie Ala Asn Lvs 
150 155 160 

AAG AAC AGC CCG AGC AAG CTG ATG TTC GAG AGC GTG AAT GAG CCT 
Lys Asn Ser Pro Ser Lys Leu Met Phe Glu Ser Val Asn Glu Pro 
165 170 175 

TTT ACG GAT GGC GGA ACT ACG GAT GAA GCC AAG CAG CAA AAA ATG 
Phe Thr Asp Gly Gly Thr Thr Asp Glu Ala Lys Gin Gin Lys Met 
180 185 190 

GAC GAG CTG AAC GTA TCC TTT TTC AAC ATC GTC AGA AAT TCC GGC 
Asp Glu Leu Asn Val Ser Phe Phe Asn He Val Arg Asn Ser Gly 
200 205 210 

CAG AAC GCG ACT CGC CCG CTA GTT CTT TCT ACG TTG GAG GCC TCT 
Gin Asn Ala Thr Arg Pro Leu Val Leu Ser Thr Leu Glu Ala Ser 
215 220 225 

ACC CAA GAG AGA ATG ACG GCG CTT TAT AAT ACG ATG ACC AAA CTG 
Thr Gin Glu Arg Met Thr Ala Leu Tyr Asn Thr Met Thr Lys Leu 
230 235 240 

GAC AAG AAT CTG ATC GCA ACC GTT CAT TTT TAT GGA TTC TGG CCG 
Asp Lys Asn Leu He Ala Thr Val His Phe Tyr Gly Phe Trp Pro 
245 250 255 

AGC GTA AAT ATC GCA GGA TAT ACG AAA TTT GAT GCG GAG ACG CAA 
Ser Val Asn He Ala Gly Tyr Thr Lys Phe Asp Ala Glu Thr Gin 
260 265 270 

GAT AJT ATA ACG ACC TTC GAT AAC GTG TAT AAC ACA TTT GTA GCA 
Asp He He Thr Thr Phe Asp Asn Val Tyr Asn Thr Phe Val Ala 
280 285 290 



465 



513 



561 



609 



657 



705 



753 



801 



849 



897 



945 



993 
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AAG GGA ATC CCG GTG GTA GTC GGC GAA TAT GGC CTT CTT GGA TTC GAT 
Lys Gly He Pro Val Val Val Gly Glu Tyr Gly Leu Leu Gly Phe Asp 
295 300 305 



1089 



AAG AAT ACC GGC GTC ATT GAA CAG GGT GAG AAA TTG AAA TTT TTC GAG 
Lys Asn Thr Gly Val He Glu Gin Gly Glu Lys Leu Lys Phe Phe Glu 
310 315 320 



1137 



10 TTT TTT GCC CAG TAT GTG AAG CAA AAA AGC ATT TCC ACT ATG CTA TGG 
Phe Phe Ala Gin Tyr Val Lys Gin Lys Ser lie Ser Thr Met Leu Trp 
325 330 335 



1185 



GAT AAC GGA CAG CAC TTC AAC CGC ACG AGC TTC AAG TGG TCT GAC CCG 
15 Asp Asn Gly Gin His Phe Asn Arg Thr Ser Phe Lys Trp Ser Asp Pro 
340 345 350 



1233 



20 



25 



GAT TTA TTC AAT ATG ATC AAG GCC AGT TGG ACC GGA CGT TCA TCC ACG 

Asp Leu Phe Asn Met He Lys Ala Ser Trp Thr Gly Arg Ser Ser Thr 

355 360 365 370 

GCT TCC AGC GAC CTG ATC CAT GTC AAG CAG GGC ACG GCG GTA AAA GAT 

Ala Ser Ser Asp Leu lie His Val Lys Gin Gly Thr Ala Val Lys Asp 

375 380 385 

ACT TCG GTT CAG CTC AAT CTT AAC GGG AAT ACG CTA ACT TCC CTT TCC 

Thr Ser Val Gin leu Asn Leu Asn Gly Asn Thr Leu Thr Ser Leu Ser 

390 395 400 

30 GTA AAT GGA ACG ACA CTG AAA TCA GGC ACA GAT TAC ACT TTA AAC AGC 

Val Asn Gly Thr Thr Leu Lys Ser Gly Thr Asp Tyr Thr Leu Asn Ser 

405 410 415 

AGC AGA TTA ACT TTT AAA GCG AGC CAG TTG ACC AAG CTG ACC TCC TTG 

35 Ser Arg Leu Thr Phe Lys Ala Ser Gin Leu Thr Lys Leu Thr Ser Leu 

420 425 430 



GGC AAA TTG GGG GTC AAC GCG ACG ATC GTG ACT AAA TTC AAT AGA GGC 

Gly Lys Leu Gly Val Asn Ala Thr He Val Thr Lys Phe Asn Arg Gly 

435 440 445 450 

GCC GAC TGG AAG TTC AAC GTA GTC CTG TAC AAT ACG CCT AAG CTT AGC 

Ala Asp Trp Lys Phe Asn Val Val Leu Tyr Asn Thr Pro Lys Leu Ser 

455 460 465 

AGT ACG ACG GGG ACT ACT TCT TCC TTT GCG ATT CCA ACG GCT TTC AAC 

Ser Thr Thr Gly Thr Thr Ser Ser Phe Ala lie Pro Thr Ala Phe Asn 

470 475 480 

50 GGG GAT CAG CTT GCT ACG ATG GAA GCG GTC TAT GTA AAC GGC GGC AAT 

Gly Asp Gin Leu Ala Thr Met Glu Ala Val Tyr Val Asn Gly Gly Asn 

485 490 495 



40 



45 



1281 



1329 
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1425 
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GCC GGT CCG CAT AAC TGG ACT TCC TTT AAG GAA TTC GAA ACG ACG TTC 1713 

Ala Gly Pro His Asn Trp Thr Ser Phe Lys Glu Phe Glu Thr Thr Phe 

500 505 510 

5 

AGC CCC GCT TAT AGC GAG GGG AAA ATC AAA CTG CAG CAG GCG TTC TTT 1761 

Ser Pro Ala Tyr Ser Glu Gly Lys He Lys Leu Gin Gin Ala Phe Phe 
515 520 525 530 

10 AAT GAA GTG AAT GAT ACC ACA GTC ACG CTC AAG TTC CAA TTC TGG AGC 1809 
Asn Glu Val Asn Asp Thr Thr Val Thr Leu Lys Phe Gin Phe Trp Ser 
535 540 545 

GGG GAG ATC GTC AAC TAC ACG ATT AAA AAG AGC GGT TCG ACG GTG ACG 1857 
15 Gly Glu He Val Asn Tyr Thr He Lys Lys Ser Gly Ser Thr Val Thr 
550 555 560 

GGT ACG GCT TCA TAAGCGAGTT TGGCAAAAAA GGACCGATAT ACTGCCTAAT 1909 
Gly Thr Ala Ser 
20 565 



25 



30 



35 



45 



50 



TTGGTATTGC 


CTTAGTTGAA 


AGCAATTGCT 


CCGAATAAAC 


AGAATGAAGC CCCGGCCAGC 


1969 


TGGCCGGGAC 


TTATGCGTTT 


AGGAAGTATA 


AACGAATCAT 


CAGCAATTTA TTTAGCTCGT 


2029 


CTCAGTTCAG 


CAATATCGGC 


TTCATGTGAA 


ACGGAGCGGA 


TGAACAATCT TTCGAGCAAT 


2089 


TTCTCATGCT 


CCTGCTGGGT 


TTGGAGAACG 


GTTTGCTGAT 


TAGTTTTAAG TACAGATATA 


2149 


TCCTCACGGA 


CTTGATTGAT 


TCATGTGGTC 


CGTTAGTTCT 


TCTACCTTTG TATTTGTGGC 


2209 


AGCAACGATA 


TGAATTAATT 


GTTGAATGTG 


CCCGCCATGA 


CTGTTTAGCT GCTCATTGTG 


2269 


GCTTTGTAAC 


TGTTCTCGGA 


TTTCTTTGAA 


TTCTTGGTCG 


TGCTCATTAA GCTT 


2323 



(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 566 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Lys Lys Arg Arg Ser Ser Lys Val He Leu Ser Leu Ala He Val 
15 10 15 

Val Ala Leu Leu Ala Ala Val Glu Pro Asn Ala Ala Leu Ala Ala Ala 
20 25 30 
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25 



40 



70 



Pro Pro Ser Ala Met Gin Ser Tyr Val Glu Ala Met Gin Pro Gly Trp 

35 40 45 

5 Asn Leu Gly Asn Ser Leu Asp Ala Val Gly Ala Asp Glu Thr Leu Ala 

50 55 60 



Arg Gly Asn Pro Arg He Thr Lys Glu Leu He Gin Asn lie Ala Ala 
65 70 75 80 

Gin Gly Tyr Lys Ser lie Arg He Pro Val Thr Trp Asp Ser His lie 
85 90 95 



Gly Ala Ala Pro Asn Tyr Gin He Glu Ala Ala Tyr Leu Asn Arg Val 
15 100 105 110 

Gin Glu Val Val Gin Trp Ala Leu Asp Ala Asn Leu Tyr Val Met lie 
115 120 125 

20 Asn Val His His Asp Ser Trp Leu Trp He Ser Lys Met Glu Ser Gin 
130 135 140 



His Asp Gin Val Leu Ala Arg Tyr Asn Ala He Trp Thr Gin He Ala 
145 150 155 160 

Asn Lys Phe Lys Asn Ser Pro Ser Lys Leu Met Phe Glu Ser Val Asn 
J65 170 175 



Glu Pro Arg Phe Thr Asp Gly Gly Thr Thr Asp Glu Ala Lys Gin Gin 
30 180 185 190 

Lys Met Leu Asp Glu Leu Asn Val Ser Phe Phe Asn He Val Arg Asn 

195 200 205 

35 Ser Gly Gly Gin Asn Ala Thr Arg Pro Leu Val Leu Ser Thr Leu Glu 
210 215 220 



Ala Ser Pro Thr Gin Glu Arg Met Thr Ala Leu Tyr Asn Thr Met Thr 
225 230 235 240 

Lys Leu Asn Asp Lys Asn Leu He Ala Thr Val His Phe Tyr Gly Phe 
245 250 255 



Trp Pro Phe Ser Val Asn He Ala Gly Tyr Thr Lys Phe Asp Ala Glu 
45 260 265 270 

Thr Gin Asn Asp He He Thr Thr Phe Asp Asn Val Tyr Asn Thr Phe 
275 280 285 

50 Val Ala Lys Gly He Pro Val Val Val Gly Glu Tyr Gly Leu Leu Gly 
290 295 300 
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Phe Asp Lys Asn Thr Gly Val He Glu Gin Gly Glu Lys Leu Lys Phe 
305 310 315 320 

Phe Glu Phe Phe Ala Gin Tyr Val Lys Gin Lys Ser He Ser Thr Met 
5 325 330 335 

Leu Trp Asp Asn Gly Gin His Phe Asn Arg Thr Ser Phe Lys Trp Ser 
340 345 350 

10 Asp Pro Asp Leu Phe Asn Met He Lys Ala Ser Trp Thr Gly Arg Ser 
355 360 365 



15 



30 



45 



Ser Thr Ala Ser Ser Asp Leu He His Val Lys Gin Gly Thr Ala Val 
370 375 380 

Lys Asp Thr Ser Val Gin Leu Asn Leu Asn Gly Asn Thr Leu Thr Ser 
385 390 395 400 



Leu Ser Val Asn Gly Thr Thr Leu Lys Ser Gly Thr Asp Tyr Thr Leu 
20 405 410 415 

Asn Ser Ser Arg Leu Thr Phe Lys Ala Ser Gin Leu Thr Lys Leu Thr 
420 425 430 

25 Ser Leu Gly Lys Leu Gly Val Asn Ala Thr He Val Thr Lys Phe Asn 
435 440 445 



Arg Gly Ala Asp trp Lys Phe Asn Val Val Leu Tyr Asn Thr Pro Lys 
450 455 460 

Leu Ser Ser Thr Thr Gly Thr Thr Ser Ser Phe Ala He Pro Thr Ala 
465 470 475 480 



Phe Asn Gly Asp Gin Leu Ala Thr Met Glu Ala Val Tyr Val Asn Gly 
35 485 490 495 

Gly Asn Ala Gly Pro His Asn Trp Thr Ser Phe Lys Glu Phe Glu Thr 
500 505 510 

40 Thr Phe Ser Pro Ala Tyr Ser Glu Gly Lys He Lys Leu Gin Gin Ala 
515 520 525 



Phe Phe Asn Glu Val Asn Asp Thr Thr Val Thr Leu Lys Phe Gin Phe 

530 535 540 

Trp Ser Gly Glu He Val Asn Tyr Thr He Lys Lys Ser Gly Ser Thr 

545 550 555 560 



Val Thr Gly Thr Ala Ser 
50 565 
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15 
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(Z) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1775 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(v) FRAGMENT TYPE: N-terminal 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus lautus 

(B) STRAIN: NCI MB 40250 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 30. .( 1625. 1775) 

(C) IDENTIFICATION METHOD: experimental 
20 (D) OTHER INFORMATION: /partial 

/evidence= EXPERIMENTAL 

/transl_except= (pos: 1446 .. 1458, aa: OTR) 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TTGAAGCGCT GAATTCAGGA GGTTAAATAA TGCGTATTCA TGCAATTCGG CAATCTTGCC 60 

3 0 GTTTGGTATT GACGATGGTT TTGATGCTTG GCTTATTGCT GCCTGTGGGC GCCCCCAAAG 120 

GCTATGCCGC TCCGGCTGTT CCTTTTGGCC AATTAAAAGT TCAGGGCAAT CAATTGGTAG 180 
GACAGTCCGG GCAAGCTGTT CAACTGGTTG GCATGAGCTC ACATGGATTG CAGTGGTATG 240 
GCAATTTCGT CAACAAATCG TCGTTGCAGT GGATGAGAGA TAACTGGGGC ATCAACGTCT 300 
TCCGTGCCGC TATGTATACT TCCGAAGATG GTTACATTAC GGATCCTTCC GTCAAGAACA 360 

4 0 AGGTGAAGGA GGCGGTTCAG GCATCCATCG ATCTGGCCTT GTACGTTATT ATTGACTGGC 420 

ATATCTTGTC TGATGGGAAT CCGAATACGT ACAAAGCGCA ATCGAAAGCG TTCTTCCAAG 480 
AGATGGCCAC ATTGTACGGC AACACGCCGA ATGTAATCTA TGAAATCGCG ACGAGCCCAA 540 
CGGAATGTGT CCTGGGCAGA TGTCAGTCGT CGGAAGAAGT GATCACGGCC ATTCGTTCGA 600 
TTGACCCCGA CGGAGTGGTT ATCGTTGGCA GTCCAACCTG GAGCCAGGAT ATTCATCTGG 660 
50 CGGCCGATAA CCCGGTATCA CATAGCAATG TCATGTATGC GCTTCATTTC TATTCAGGCA 720 
CGCATGGACA GTTTTTGAGA GACCGAATTA CCTATGCGAT GAACAAAGGA GCAGCGATCT 780 
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15 



25 



35 



TCGTTACCGA 


GTGGGGCACC 


"7 T 

AGTGATGCAT CCGGGAACGG CGGGCCGTAT TTGCCTCAGT 


840 


CCAAAGAGTG 


GATCGATTTC 


TTGAATGCTC GCAAGATCAG CTGGGTGAAC TGGTCGCTCG 


900 


CTGATAAAGT 


AGAAACGTCT 


GCTGCTCTTA TGCCAGGTGC ATCGCCTACC GGCGCTGGAC 


960 


CGATGCCCAA 


TTGTCGAATG 


GGCAAATCGG GTTCGCGATC AAATCCGGCA AGCAACTGGA 


1020 


GGCGGCAGGG 


CAATCCAACT 


GCACCGGCTG CCCCTACTAA CCTCTCGGCA AACGGCGGCA 


1080 


ACGCCCAGGT 


ATCATTAACC 


TGGAACGCAG TTAGCGGGGC GACGAGCTAT ACCGTAAAGC 


1140 


GAGCAACGAC 


GAGCGGCGGT 


CCGTACACGA ATGTGGACCG GGGTGTCACG GCGACGAGCT 


1200 


ACACGAACAC 


CGGGCTGACG 


AATGGCACGA CGTATTATTA TGTCGTGAGG GCATCCAATA 


1260 


GCGCGGGCAG 


CAGCGCGAAC 


TCCGCGCAAG CGAGCGCAAC GCCGGCTAGC GGCGGCGCCA 


1320 


GTACGGGGAA 


CCTTGTTGTC 


CAATACAAAG TTGGCGACAC TAGCGCCACG GATAACCAAA 


1380 


TGAAGCCTTC 


CTTTAACATC 


AAGAACAACG GTACAACCCC TGTTAACCTG AGCGGCCTCA 


1440 


AGCTTNNNNN 


NNNNNNNNAA 


AAAGACGGAC CTGCGGATAT GAGCTGCTCG ATCGACTGGG 


1500 


CGCAAATCGG 


CCGAACGAAT 


GTTCTGCTGG CATTCGCTAA CTTTACCGGG AGTAATACGG 


1560 


ATACTTACTG 


TTGTGAGCTA 


AGCTTTAGCT GCACTGCAGG TTCGTATCCC GGCTATGCGT 


1620 


GGACNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 


1680 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 


1740 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN NNNNN 


1775 


(2) INFORMATION FOR SEQ ID N0:6: 





(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1609 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

45 

(v) FRAGMENT TYPE: N-terminal 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus lautus 

(B) STRAIN: NCIMB 40250 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 30.. 1607 
(D) OTHER INFORMATION: 



10 



15 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

TTGAAGCGCT GAATTCAGGA GGTTAAATA ATG CGT ATT CAT GCA ATT CGG CAA 53 

Met Arg lie His Ala He Arg Gin 

:1 5 

TCT TGC CGT TTG GTA TTG ACG ATG GTT TTG ATG CTT GGC TTA TTG CTG 101 
Ser Cys Arg Leu Val Leu Thr Met Val Leu Met Leu Gly Leu Leu Leu 
10 15 20 

PrI 51? SfS G ?° o CC AAA G ? C I AT GCC GCT CCG GCT GTT «T TTT GGC 149 
Pro Val Gly Ala Pro Lys Gly Tyr Ala Ala Pro Ala Val Pro Phe Gly 
25 30 35 4 £ 

rf? TLu ST C f G Sf C AAT CAA 115 GTA GGA CAG TCC GGG C AA GCT 197 
Gin Leu Lys Val Gin Gly Asn Gin Leu Val Gly Gin Ser Gly Gin Ala 

45 50 55 

25 SIT Sf* P G !P7 G ? C £ TG AGC TCA CAT GGA m CAG TGG TAT GGC AAT 245 
Val Gin Leu Val Gly Met Ser Ser His Gly Leu Gin Trp Tyr Gly Asn 

60 65 70 

- n J™ S TC-AAC fAA j CG TCG TTG CAG TGG ATG AGA GAT AAC TGG GGC ATC 293 
30 Phe Val Asn Lys Ser Ser Leu Gin Trp Met Arg Asp Asn Trp Gly He 
75 80 85 

SIS SI? ll C £ GT G f C S CT 2 TG TAT ACT TCC GAA GAT GGT TAC ATT ACG : 341 
Asn Val Phe Arg Ala Ala Met Tyr Thr Ser Glu Asp Gly Tyr He Thr 

90 95 100 

GAT CCT TCC GTC AAG AAC AAG GTG AAG GAG GCG GTT CAG GCA TCC ATC 389 
Asp Pro Ser Val Lys Asn Lys Val Lys Glu Ala Val Gin Ala Ser lie 
105 HO 115 120 

GAT CTG GCC TTG TAC GTT ATT ATT GAC TGG CAT ATC TTG TCT GAT GGG 437 
Asp Leu Ala Leu Tyr Val He He Asp Trp His He Leu Ser Asp Gly 
125 130 135 

45 !JI 22 HI ™ G J AC ^ A G f G S AA TCG AAA GCG TTC TTC CAA GAG ATG 485 
Asn Pro Asn Thr Tyr Lys Ala Gin Ser Lys Ala Phe Phe Gin Glu Met 
140 145 150 

sn G ? C ?E A I 1 " 6 I AC GGC MC ACG CCG AAT GTA ATC TAT GAA ATC GCG ACG 533 
50 Ala Thr Leu Tyr Gly Asn Thr Pro Asn Val lie Tyr Glu He Ala Thr 
155 160 165 



40 



RERLACSUPMreui 



WO 91/10732 



PCT/DK91/00013 



75 



10 



15 



20 



25 



30 



35 



40 



45 



50 



AGC CCA ACG GAA TGT GTC CTG GGC AGA TGT CAG TCG TCG GAA GAA GTG 581 
Ser Pro Thr Glu Cys Val Leu Gly Arg Cys Gin Ser Ser Glu Glu Val 
170 175 180 

ATC ACG GCC ATT CGT TCG ATT GAC CCC GAC GGA GTG GTT ATC GTT GGC 629 
lie Thr Ala He Arg Ser lie Asp Pro Asp Gly Val Val He Val Gly 
185 190 195 200 

AGT CCA ACC TGG AGC CAG GAT ATT CAT CTG GCG GCC GAT AAC CCG GTA 677 
Ser Pro Thr Trp Ser Gin Asp He His Leu Ala Ala Asp Asn Pro Val 
205 210 215 

TCA CAT AGC AAT GTC ATG TAT GCG CTT CAT TTC TAT TCA GGC ACG CAT 725 
Ser His Ser Asn Val Met Tyr Ala Leu His Phe Tyr Ser Gly Thr His 
220 225 230 

GGA CAG TTT TTG AGA GAC CGA ATT ACC TAT GCG ATG AAC AAA GGA GCA 773 
Gly Gin Phe Leu Arg Asp Arg He Thr Tyr Ala Met Asn Lys Gly Ala 
235 240 245 

GCG ATC TTC GTT ACC GAG TGG GGC ACC AGT GAT GCA TCC GGG AAC GGC 821 
Ala He Phe Val Thr Glu Trp Gly Thr Ser Asp Ala Ser Gly Asn Gly 
250 255 260 

GGG CCG TAT TTG CCT CAG TCC AAA GAG TGG ATC GAT TTC TTG AAT GCT 869 
Gly Pro Tyr Leu Pro Gin Ser Lys Glu Trp He Asp Phe Leu Asn Ala 
265 270 275 280 

CGC AAG ATC AGC TGG GTG AAC TGG TCG CTC GCT GAT AAA GTA GAA ACG 917 
Arg Lys lie Ser Trp Val Asn Trp Ser Leu Ala Asp Lys Val Glu Thr 
285 290 295 

TCT GCT GCT CTT ATG CCA GGT GCA TCG CCT ACC GGC GCT GGA CCG ATG 965 
Ser Ala Ala Leu Met Pro Gly Ala Ser Pro Thr Gly Ala Gly Pro Met 
300 305 310 

CCC AAT TGT CGA ATG GGC AAA TCG GGT TCG CGA TCA AAT CCG GCA AGC 1013 
Pro Asn Cys Arg Met Gly Lys Ser Gly Ser Arg Ser Asn Pro Ala Ser 
315 320 325 

AAC TGG AGG CGG CAG GGC AAT CCA ACT GCA CCG GCT GCC CCT ACT AAC 1061 
Asn Trp Arg Arg Gin Gly Asn Pro Thr Ala Pro Ala Ala Pro Thr Asn 
330 335 340 

CTC TCG GCA AAC GGC GGC AAC GCC CAG GTA TCA TTA ACC TGG AAC GCA 1109 
Leu Ser Ala Asn Gly Gly Asn Ala Gin Val Ser Leu Thr Trp Asn Ala 
345 350 355 360 

GTT AGC GGG GCG ACG AGC TAT ACC GTA AAG CGA GCA ACG ACG AGC GGC 1157 
Val Ser Gly Ala Thr Ser Tyr Thr Val Lys Arg Ala Thr Thr Ser Gly 
365 370 375 
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GGT CCG TAC ACG AAT GTG GAC CGG GGT GTC ACG GCG ACG AGC TAC ACG 1205 
Gly Pro Tyr Thr Asn Val Asp Arg Gly Val Thr Ala Thr Ser Tyr Thr 
380 385 390 

5 AAC ACC GGG CTG ACG AAT GGC ACG ACG TAT TAT TAT GTC GTG AGG GCA 1253 
Asn Thr Gly Leu Thr Asn Gly Thr Thr Tyr Tyr Tyr Val Val Arg Ala 
395 400 405 

TCC AAT AGC GCG GGC AGC AGC GCG AAC TCC GCG CAA GCG AGC GCA ACG 1301 
10 Ser Asn Ser Ala Gly Ser Ser Ala Asn Ser Ala Gin Ala Ser Ala Thr 
410 415 420 

CCG GCT AGC GGC GGC GCC AGT ACG GGG AAC CTT GTT GTC CAA TAC AAA 1349 
Pro Ala Ser Gly Gly Ala Ser Thr Gly Asn Leu Val Val Gin Tyr Lys 
15 425 430 435 440 

GTT GGC GAC ACT AGC GCC ACG GAT AAC CAA ATG AAG CCT TCC TTT AAC 1397 
Val Gly Asp Thr Ser Ala Thr Asp Asn Gin Met Lys Pro Ser Phe Asn 
445 450 455 

ATC AAG AAC AAC GGT ACA ACC CCT GTT AAC CTG AGC GGC CTC AAG CTT 1445 
He Lys Asn Asn Gly Thr Thr Pro Val Asn Leu Ser Gly Leu Lys Leu 
450 465 470 

25 NNN NNN NNN NNN NAA AAA GAC GGA CCT GCG GAT ATG AGC TGC TCG ATC 1493 
Xaa Xaa Xaa Xaa Xaa Lys Asp Gly Pro Ala Asp Met Ser Cys Ser He 
475 480 485 

GAC TGG GCG CAA A*TC GGC CGA ACG AAT GTT CTG CTG GCA TTC GCT AAC 1541 
30 Asp Trp Ala Gin He Gly Arg Thr Asn Val Leu Leu Ala Phe Ala Asn " 
490 495 500 

Phi ThS r? G ? GT !* T 5E G 5 AT !E T TAC TGT TGT GAG CTA AGC TTT AGC 1589 
cnf hr Gly Ser Asn Thr Asp Thr Tyr Xys Cys Glu Leu Ser Phe Ser 
35 505 510 515 



20 



40 



50 



520 



TGC ACT GCA GGT TCG TAT CCC GGC TAT GCG TGG AC 1574 
Cys Thr Ala Gly Ser Tyr Pro Gly Tyr Ala Trp 
525 

(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 526 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
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Met Arg lie His Ala He Arg Gin Ser Cys Arg Leu Val Leu Thr Met 

1 5 10 15 

Val Leu Met Leu Gly Leu Leu Leu Pro Val Gly Ala Pro Lys Gly Tyr 

5 20 25 30 

Ala Ala Pro Ala Val Pro Phe Gly Gin Leu Lys Val Gin Gly Asn Gin 

35 40 45 

10 Leu Val Gly Gin Ser Gly Gin Ala Val Gin Leu Val Gly Met Ser Ser 

50 55 60 



15 



30 



45 



His Gly Leu Gin Trp Tyr Gly Asn Phe Val Asn Lys Ser Ser Leu Gin 

65 70 75 80 

Trp Met Arg Asp Asn Trp Gly He Asn Val Phe Arg Ala Ala Met Tyr 

85 90 95 



Thr Ser Glu Asp Gly Tyr He Thr Asp Pro Ser Val Lys Asn Lys Val 
20 100 105 110 

Lys Glu Ala Val Gin Ala Ser He Asp Leu Ala Leu Tyr Val He He 
115 120 125 

25 Asp Trp His He Leu Ser Asp Gly Asn Pro Asn Thr Tyr Lys Ala Gin 
130 135 140 



Ser Lys Ala Phe J>he Gin Glu Met Ala Thr Leu Tyr Gly Asn Thr Pro 
145 150 155 160 

Asn Val He Tyr Glu He Ala Thr Ser Pro Thr Glu Cys Val Leu Gly 
165 170 175 



Arg Cys Gin Ser Ser Glu Glu Val He Thr Ala He Arg Ser He Asp 
35 180 185 190 

Pro Asp Gly Val Val He Val Gly Ser Pro Thr Trp Ser Gin Asp He 
195 200 205 

4 0 His Leu Ala Ala Asp Asn Pro Val Ser His Ser Asn Val Met Tyr Ala 
210 215 220 



Leu His Phe Tyr Ser Gly Thr His Gly Gin Phe Leu Arg Asp Arg He 
225 230 235 240 

Thr Tyr Ala Met Asn Lys Gly Ala Ala He Phe Val Thr Glu Trp Gly 
245 250 255 



Thr Ser Asp Ala Ser Gly Asn Gly Gly Pro Tyr Leu Pro Gin Ser Lys 
50 260 265 270 
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Glu Trp lie Asp Phe Leu Asn Ala Arg Lys He Ser Trp Val Asn Trp 

280 285 

Ser Leu Ala Asp Lys Val Glu Thr Ser Ala Ala Leu Met Pro Gly Ala 

Ser Pro Thr Gly Ala Gly Pro Met Pro Asn Cys Arg Met Gly Lys Ser 

3 ]L 5 



320 



Gly Ser Arg Ser Asn Pro Ala Ser Asn Trp Arg Arg Gin Gly Asn Pro 
3Z5 330 335 

Thr Ala Pro Ala Ala Pro Thr Asn Leu Ser Ala Asn Gly Gly Asn Ala 
15 345 350 

Gin Val Ser Leu Thr Trp Asn Ala Val Ser Gly Ala Thr Ser Tyr Thr 



365 



Val Lys Arg Ala Thr Thr Ser Gly Gly Pro Tyr Thr Asn Val Asp Arg 

° /u 375 3 

Gly Val Thr Ala Thr Ser Tyr Thr Asn Thr Gly Leu Thr Asn Gly Thr 

" 5yu 395 400 

Thr Tyr Tyr Tyr Val Val Arg Ala Ser Asn Ser Ala Gly Ser Ser Ala 
4Uo 4io 



415 



Asn Ser Ala Gin Ala Ser Ala Thr Pro Ala Ser Gly Gly Ala Ser Thr 
30 425 430 

Gly Asn Leu Val Val Gin Tyr Lys Val Gly Asp Thr Ser Ala Thr Asp 

HOD 440 445 r 

35 Tso ^ LyS Pr ° S6r X! Asn 11 e L * s As " Asn Gly Thr Thr Pro 

455 450 

Val Asn Leu Ser Gly Leu Lys Leu Xaa Xaa Xaa Xaa Xaa Lys Asp Gly 

* /u 475 480 



Pro Ala Asp Met Ser Cys Ser He Asp Trp Ala Gin He Gly Arg Thr 
485 490 495 

Asn Val Leu Leu Ala Phe Ala Asn Phe Thr Gly Ser Asn Thr Asp Thr 
45 505 510 

Tyr Cys Cys Glu Leu Ser Phe Ser Cys Thr Ala Gly Ser Tyr Pro Gly 



525 



Tyr Ala Trp 
50 530 
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CLAIMS 

1. An enzyme which exhibits cell ulase activity, which enzyme is producible 
by a strain of Bacillus spp., NCIMB 40250, or a related Bacillus spp. strain, 

5 or a derivative of said cellulase. 

2. An enzyme according to claim 1, which exhibits an endoglucanase activity 
of at least about 10, more preferably at least about 20, most preferably at 
least about 25, such as about 30, CMC-endoase units (as defined herein) per 

10 mg of total protein under alkaline conditions. 

3. An enzyme according to claim 1, which is active at a temperature of up to 
about 65 - C. 

15 4. An enzyme according to any of claims 1-3, which is an endoglucanase with 
an apparent molecular weight of 75 kD or a cleavage product thereof 
exhibiting endoglucanase activity. 

5. An enzyme according to claim 4, which is an endoglucanase encoded by the 
20 DNA sequence shown in the appended Sequence Listing ID#1, or a modification 
thereof encoding a derivative of said endoglucanase. 

5. An enzyme according to any of claims 1-3, which is an endoglucanase with 
an apparent molecular weight of 56 kD or a cleavage product thereof 
25 exhibiting endoglucanase activity. 

7. An enzyme according to claim 6, which is an endoglucanase encoded by the 
DNA sequence shown in the appended Sequence Listing ID##, or a modification 
thereof encoding a derivative of said endoglucanase. 

30 

8. An enzyme according to any of claims 1-3, which is an endoglucanase with 
an apparent molecular weight of 45 kD or a cleavage product thereof 
exhibiting endoglucanase activity. 
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9. An enzyme according to claim 8, which is an endoglucanase encoded by the 
DNA sequence shown in the appended Sequence Listing ID#6, or a modification 
thereof encoding a derivative of said endoglucanase. 

5 10. An enzyme according to any of claims 1-3, which is an endoglucanase with 
an apparent molecular weight of 60 kD or a cleavage product thereof 
exhibiting endoglucanase activity, or which is an endoglucanase with an 
apparent molecular weight of 56 kD or a cleavage product thereof exhibiting 
endoglucanase activity. 

10 

11. An enzyme according to any of claims 1-3, which is an endoglucanase with 
an apparent molecular weight of 92 kD or a cleavage product thereof 
exhibiting endoglucanase activity. 

15 12. An enzyme which comprises a core region derived from an endoglucanase 
according to any of claims 1-11 combined with a cellulose-binding domain 
derived from another cellulase enzyme, or a core region derived from another 
cellulase enzyme combined with a cellulose-binding domain derived from an 
endoglucanase according to any of claims 1-11. 

20 

13. An enzyme according to claim 12, wherein the core region is derived from 
a cellulase enzyme which does not, in nature, comprise a cellulose-binding 
domain. 

25 14. A DNA construct which comprises a DNA sequence encoding an enzyme 
exhibiting cellulase activity, which enzyme is derivable from a strain of 
Bacillus spp., NCIMB 40250, or a related Bacillus spp. strain, or a 
derivative of said cellulase. 

30 15. A DNA construct according to claim 14, wherein the DNA sequence encodes 
an endoglucanase with an apparent molecular weight of 75 kD. 

16. A DNA construct according to claim 15, wherein the DNA sequence is tjie 
one shown in the appended Sequence Listing ID#1, or a modification thereof 
35 encoding a derivative of said endoglucanase. 
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17. A DNA construct according to claim 14, wherein the DNA sequence encodes 
an endoglucanase with an apparent molecular weight of 56 kD. 

5 18. A DNA construct according. to claim 17, wherein the DNA sequence is the 
one shown in the appended Sequence Listing ID#3, or a modification thereof 
encoding a derivative of said endoglucanase. 

19. A DNA construct according to claim 14, wherein the DNA sequence encodes 
10 an endoglucanase with an apparent molecular weight of 45 kD. 

20. A DNA construct according to claim 19, wherein the DNA sequence is the 
one shown in the appended Sequence Listing ID#6, or a modification thereof 
encoding a derivative of said endoglucanase. 

15 

21. A DNA construct according to claim 14, wherein the DNA sequence encodes 
an endoglucanase with an apparent molecular weight of 92 kD. 

22. An expression vector which carries an inserted DNA construct according 
20 to any of claims 14-21. 

23. A cell which is transformed with a DNA construct according to any of 
claims 14-21 or with an expression vector according to claim 22. 

25 24. A cell according to claim 23, which is a bacterium. 

25. A cell according to claim 24, which is a grampositive bacterium. 

26. A eel 1 according to claim 25, wherein the grampositive bacterium is 
30 selected from the group consisting of Bacillus subtilis . Bacillus 

l icheniformis , Bacillus lentus, Bacillus brevis, Bacillus stearothermnnhii... , 
Baci11us alkalophilus, Bacillus amvloliouefacipn* . Bacillus coaoulans .' 
Bacillus circulans or Bacillus latit.m: 
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27. A cell according to claim 24, which is a gramnegative bacterium, e.g. 
Escherichia col i . 

28. A method of producing an enzyme according to any of claims 1-13, wherein 
5 a host cell according to any. of claims 23-27 is cultured under conditions 

conducive to the production of the endoglucanase or a derivative thereof, and 
the endoglucanase or derivative thereof is subsequently recovered from the 
culture. 



10 29. A method according to claim 28, wherein the endoglucanase is recovered 
in mature form. 

30. A cellulolytic agent capable of degrading amorphous regions of cellulose 
fibres, the agent comprising an enzyme according to any of claims 1-13. 

15 

31. An agent according to claim 30, which comprises a combination of two or 
more cellulases recited in any of claims 1-13, or a combination of one or 
more cellulases recited in any of claims 1-13 with one or more other enzymes 
with eel lulase .activity. 

JO 

32. An agent according to claim 30 or 31, which is in the form of a non- 
dusting granulate, stabilized liquid or protected enzyme. 

33. An agent according to any of claims 30-32, which exhibits an 
!5 endoglucanase activity of 500-10,000 CMC-endoase units per gram of the agent. 



34. An agent according to any of claims 30-33, which is a detergent additi 



ve, 



35. An agent according to claim 34, which additionally comprises another 
0 enzyme such a a protease, lipase and/or amylase. 

36. A detergent composition comprising a cellulolytic agent according to any 
of claims 30-35. 
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37. A detergent composition according to claim 36, which exhibits 
endoglucanase activity of 0.3-400 CMC-endoase units per gram of detergent. 

38. A method of reducing the rate at which cellulose-containing fabrics 
5 become harsh or of reducing the harshness of cellulose-containing fabrics, 

the method comprising treating cellulose-containing fabrics with a 
cellulolytic agent according to any of claims 30-35. 

39. A method according to claim 38, wherein the treatment of the fabrics with 
10 the cellulolytic agent is conducted during soaking, washing or rinsing of the 

fabrics. 

40. A method of treating a coloured, cellulose-containing fabric in order to 
provide colour clarification, the method comprising treating the cellulose- 
containing fabric with a cellulolytic agent according to any of claims 30- 
35. 



41. A method according to claim 40, wherein the treatment of the fabric with 
the cellulolytjc agent is conducted in an aqueous medium during soaking, 

20 washing or rinsing of the fabric. 

42. A method according to claim 41, wherein the aqueous medium exhibits an 
endoglucanase activity of more than about 250 CMC-endoase units per liter of 
the aqueous medium. 

25 
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Fig. 5 
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Fig. 7a 
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